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r- { , We present a static analysis technique for non-termination inference of logic programs. Our 

framework relies on an extension of the subsumption test, where some specific argument 
fvj I positions can be instantiated while others are generalized. We give syntactic criteria to 

p<j ■ statically identify such argument positions from the text of a program. Atomic left looping 

queries are generated bottom-up from selected subsets of the binary unfoldings of the pro- 
gram of interest. We propose a set of correct algorithms for automating the approach. Then, 
non-termination inference is tailored to attempt proofs of optimality of left termination con- 
ditions computed by a termination inference tool. An experimental evaluation is reported. 
fj I When termination and non-termination analysis produce complementary results for a logic 

procedure, then with respect to the leftmost selection rule and the language used to describe 
sets of atomic queries, each analysis is optimal and together, they induce a characterization 
of the operational behavior of the logic procedure. 
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Tj- ! 1 Introduction 

O 

"^ I Since the work of N. Lindcnstrauss on TcrmiLog [201 El j several automatic tools for termination 

O ■ checking (e.g. TALP ^) or termination inference (e.g. cTI |25[ I26| or TerminWeb ^7]) are now 

available to the logic programmer. As the halting problem is undecidable for logic programs, such 
analyzers compute sufficient termination conditions implying left termination. In most works, 

rxj I only universal left termination is considered and termination conditions rely on a language for 

t^ i describing classes of atomic queries. The search tree associated to any (concrete) query satisfying 

a termination condition is guaranteed to be finite. When terms are abstracted using the term-size 
norm, the termination conditions are (disjunctions of) conjunctions of conditions of the form "the 
i-ih argument is ground" . Let us call this language Cterm ■ 

In this report, which is based on an earlier conference paper j27| . we present the first approach 
to non-termination inference tailored to attempt proofs of optimality of termination conditions at 
verification time for pure logic programs. The aim is to ensure the existence, for each class of 
atomic queries not covered by a termination condition, of one query from this class which leads to 
an infinite search tree when such a query is proved using any standard Prolog engine. We shall first 
present an analysis which computes classes of left looping queries, where any atomic query from 
such a class is guaranteed to lead to at least one infinite derivation under the usual left-to-right 
selection rule. Intuitively, we begin by computing looping queries from recursive binary clauses 
of the form p[. . .) ^ p{. . .). Then we try to add binary clauses of the form q{. . .) <— p{. . .) to 
increase the set of looping queries. Finally by combining the result of non-termination inference 
with termination inference, for each predicate, we compute the set of modes for which the overall 
verification system has no information. 

The main contributions of this work are: 

• A new application of binary unfoldings to left loop inference. |16| introduced the binary 



unfoldings of a logic program P as a goal independent technique to transform P into a 
possibly infinite set of binary clauses, which preserves the termination property 7 while 
abstracting the standard operational semantics. We present a correct algorithm to construct 
left looping classes of atomic goals, where such classes are computed bottom-up from selected 
subsets of the binary unfoldings of the analyzed program. 

• A correct algorithm which, when combined with termination inference |23| . may detect 
optimal left termination conditions expressed in Cterm for logic programs. When termination 
and non-termination analysis produce complementary results for a logic procedure, then with 
respect to the leftmost selection rule and the language used to describe sets of atomic queries, 
each analysis is optimal and together, they induce a characterization of the operational 
behavior of the logic procedure. 

• A report on the experimental evaluation we conduct. We have fully implemented termination 
and non-termination inference for logic programs. We have run the couple of analyzers on 
a set of classical logic programs, the sizes of which range from 2 to 177 clauses. The results 
of this experiment should help the reader to appreciate the value of the approach. 

We organize the paper as follows: Section |21 presents the notations. In Section |31 we study 
loop inference for binary programs. We offer a full set of correct algorithms for non-termination 
inference in Section ^ and optimality proofs of termination conditions in Section |S1 Finally, in 
Sectional we discuss related works. The detailed proofs of the results can be found in AppcndixlBl 
at the end of the article. 

2 Preliminaries 

2.1 Functions 

Let E and F be two sets. Then, f : E —^ F denotes that / is a partial function from E to F and 
f : E :—> F denotes that / is a function from E to F. The domain of a partial function / from E 
to F is denoted by Dom{f) and is defined as: Doni{f) = {e \ e € E, /(e) exists}. Thus, if / is a 
function from E to F, then Dom{f) = E. Finally, \i f : E —^ F \s a. partial function and E' is a 
set, then f\E' is the function from Dom{f) Ci E' to F such that for each e G Dom{f) n E' , f\E' 
maps e to /(e). 

2.2 Logic Programming 

We strictly adhere to the notations, definitions, and results presented in ^. 

N denotes the set of non-negative integers and for any n ^ N, [l,n] denotes the set {1, . . . , n}. 
If n = then [1, n] = 0. 

From now on, we fix a language C of programs. We assume that C contains an infinite number 
of constant symbols. The set of relation symbols of C is 11, and we assume that each relation 
symbol p has a unique arity, denoted arity{p). TUc (resp. TBc) denotes the set of all (ground 
and non ground) terms of C (resp. atoms oi C). A query is a finite sequence of atoms Ai, . . . , An 
(where n > 0). When n = 1, we say that the query is atomic. Throughout this article, the 
variables of £ are denoted hy X,Y, Z, . . . , the constant symbols by a, 6, ... , the function symbols 
by /, 5, ft,, . . . , the relation symbols by p, g, r, . . . , the atoms by A, i3, . . . and the queries by Q, Q' , 
... or by A, B, . . . 

Let t be a term. Then Var{t) denotes the set of variables occurring in t. This notation is 
extended to atoms, queries and clauses. Let 9 :— {Xi/ti, . . . ,Xn/tn} be a substitution. We 
denote by Dom{9) the set of variables {Xi, . . . , Xn} and by Ran{9) the set of variables appearing 
in ii, . . . , i„. We define Var{9) — Dom{9) U Ran{9). Given a set of variables V , 9\V denotes the 
substitution obtained from 9 by restricting its domain to V . 



Let i be a term and be a substitution. Then, the term t0 is called an instance of t. If is a 
renaming (i.e. a substitution that is a 1-1 and onto mapping from its domain to itself), then t9 is 
called a variant of t. Finally, t is called more general than t' if t' is an instance of t. 

A logic program is a finite set of definite clauses. In program examples, we use the ISO-Prolog 
syntax. Let P be a logic program. Then lip denotes the set of relation symbols appearing in P. 
In this paper, we only focus on left derivations i.e. we only consider the leftmost selection rule. 
Consider a non-empty query B, C and a clause c. Let _ff ^ B be a variant of c variable disjoint 
with B, C and assume that B and H unify. Let 9 be an mgu of B and H. Then B, C ^=>(B, C)9 

c 

is a left derivation step with _ff <— B as its input clause. If the substitution 9 or the clause c is 
irrelevant, we drop a reference to it. 

Let Qo be a query. A maximal sequence Qo =^ Qi =^ • ■ • of left derivation steps is called a 

Cl C2 

left derivation of PU {Qo} if ci, C2, ... are clauses of P and if the standardization apart condition 
holds, i.e. each input clause used is variable disjoint from the initial query Qo a-nd from the mgu's 
and input clauses used at earlier steps. A finite left derivation may end up either with the empty 
query (then it is a successful left derivation) or with a non-empty query (then it is a failed left 
derivation). We say Qo Isft loops with respect to (w.r.t.) P if there exists an infinite left derivation 
of P U {Qo}- We write Q=>Q' if there exists a finite non-empty prefix ending at Q' of a left 
derivation of P U {Q}. 

2.3 The Binary Unfoldings of a Logic Program 

Let us present the main ideas about the binary unfoldings jl6| of a logic program, borrowed from 
[?]• This technique transforms a logic program P into a possibly infinite set of binary clauses. 
Intuitively, each generated binary clause H ^ B (where B is either an atom or the atom true 
which denotes the empty query) specifies that, with respect to the original program P, a call to 
H (or any of its instances) necessarily leads to a call to B (or its corresponding instance). 

More precisely, let Q be an atomic query. Then A is a call in a left derivation of P U {Q} if 

Q =^ A, B. We denote by callsp{Q) the set of calls which occur in the left derivations of PU {Q}. 

The specialization of the goal independent semantics for call patterns for the left-to-right selection 
rule is given as the fixpoint of an operator Tp over the domain of binary clauses, viewed modulo 
renaming. In the definition below, id denotes the set of all binary clauses of the form true <— true 
or p{Xi, . . . , Xn) ^- p{Xi, . . . , Xn) for any p G lip, where arity{p) — n. 

c:^ H ^ Bi,...,B,n<^ P, ie [l,m], 

(Hj ^- trueYz}^ G X renamed with fresh variables. 

Hi ^ B ^ X \J id renamed with fresh variables, 

i < m ^ B =/= true 

9 = mgui{Bi,...,B,),{Hi,...,H,)) 



T^{X) ^ {{H '^ B)9 



We define its powers as usual. It can be shown that the least fixpoint of this monotonic operator 
always exists and we set bin-unf {P) := lfp{Tp). Then the calls that occur in the left derivations of 
PU{Q} can be characterized as follows: callsp{Q) = {B9\H ^ B G bin-unf{P), 9 = mgu{Q, H)}. 
This last property was one of the main initial motivations of the proposed abstract semantics, en- 
abling logic programs optimizations. Similarly, bin_unf{P) gives a goal independent representation 
of the success patterns of P. 

But we can extract more information from the binary unfoldings of a program P: universal 
left termination of an atomic query Q with respect to P is identical to universal termination of 
Q with respect to bin_unf{P). Note that the selection rule is irrelevant for a binary program and 
an atomic query, as each subsequent query has at most one atom. The following result lies at the 
heart of Codish's approach to termination: 

Theorem 2.1 £W Let P be a program and Q an atomic query. Then Q left loops with respect to 
P iff Q loops with respect to bin-unf{P). 



Notice that bin_unf(P) is a possibly infinite set of binary clauses. For this reason, in the algorithms 
of Section ^ we compute only the first max iterations of Tp where max is a parameter of the 



analysis. As an immediate consequence of Theorem 12. II assume that we detect that Q loops with 
respect to a subset of the binary clauses of Tp "f i, with i £ N. Then Q loops with respect to 
bin_unf{P) hence Q left loops with respect to P. 

Example 2.2 Consider the following program P (see 177, p. 56-58): 

p(X,Z) :- p(Y,Z),q(X,Y). p(X,X) . q(a,b) . 

The binary unfoldings of P are: 

T^ T = 

T^tl = {piX, Z) ^p{Y,Z),p{X,X)^ true, q{a,b)^ true} UT^^O 

T^T2 = {p{a,b)^true,piX,Y)^q{X,Y)}UT^'\ I 

T^rS = {p{X,b)^q{X,a),p{X,Z)^q{Y,Z)}UT^^2 

T^T4 = {p{X,b)^q{Y,a)}UT^U 

T^ t 5 = T^ T 4 = bin_unf{P) 

Let Q :— p{X,b). Note that Q loops w.r.t. Tp | 1, hence it loops w.r.t. bin_unf{P). So Q left 
loops w.r.t. P. D 

3 Loop Inference Using Filters 

In this paper, we propose a mechanism that, given a logic program P, generates at verification 
time classes of atomic queries that left loop w.r.t. P. Our approach is completely based on 
the binary unfoldings of P and relies on Theorem 12.11 It consists in computing a finite subset 
BinProg of bin_unf (P) and then in inferring a set of atomic queries that loop w.r.t. BinProg. By 
Theorem 12. II these queries left loop w.r.t. P. 

Hence, we reduce the problem of inferring looping atomic queries w.r.t. a logic program to 
that of inferring looping atomic queries w.r.t. a binary program. This is why in the sequel, our 
definitions, results and discussions mainly concentrate on binary programs only. 

The central point of our method is the subsumption test, as the following lifting lemma, 
specialized for the leftmost selection rule, holds: 

Lemma 3.1 (One Step Lifting, '11) Let Q^^Qi be a left derivation step, Q' be a query that is 

c 

more general than Q and c' be a variant of c variable disjoint with Q' . Then, there exists a query 
Q'l that is more general than Qi and such that Q' ^^Q'l with input clause c' . 

c 

From this result, we derive: 

Corollary 3.2 Let c := H ^ B be a binary clause. Lf B is more general than H then H loops 
w.r.t. {c}. 

Corollary 3.3 Let c := H ^ B be a clause from a binary program BinProg. If B loops w.r.t. 
BinProg then H loops w.r.t. BinProg. 

These corollaries provide two sufficient conditions that can be used to design an incremental 
bottom- up mechanism that infers looping atomic queries. Given a binary program BinProg, it 
suffices to build the set Q of atomic queries consisting of the heads of the clauses whose body is 
more general than the head. By Corollarv l3.2l the elements of Q loop w.r.t. BinProg. Then, by 
Corollarv l3.3l the head of the clauses whose body is more general than an element of Q can safely 
been added to Q while retaining the property that every query in Q loops w.r.t. BinProg. 

Notice that using this technique, we may not detect some looping queries. In ^^, the authors 
show that there is no algorithm that, when given a right-linear binary recursive clause {i.e. a 



binary clause p{- ■ ■) <~ p(- • ■ ) such that all variables occur at most once in the body) and given 
an atomic query, always decides in a finite number of steps whether or not the resolution stops. In 
the case of a linear atomic query {i.e. an atomic query such that all variables occur at most once) 
however, the halting problem of derivations w.r.t. one binary clause is decidable [23 El El- 
It can be argued that the condition provided by Corollary 13.21 is rather weak because it fails 
at inferring looping queries in some simple cases. This is illustrated by the following example. 

Example 3.4 Let c be the clause p{X) ^^ p{f(X)). We have the infinite derivation: 

p{X) =^p{f{X))=^p{f{f{X))) =^p{f{f{f{X)))) ■ ■ ■ 



But, since the body of c is not more general than its head, Corollary \S.S\ does not allow to infer 
that p(X) loops w.r.t. {c}. D 

In this section, we distinguish a special kind of argument positions that are "neutral" for 
derivation. Our goal is to extend the relation "is more general than" by, roughly, disregarding the 
predicate arguments whose position has been identified as neutral. Doing so, we aim at inferring 
more looping queries. 

Intuitively, a set of predicate argument positions A is "Derivation Neutral" (DN for short) for 
a binary clause c when the following holds. Let Q be an atomic query and Q' be a query obtained 
by replacing by any terms the predicate arguments in Q whose position is in A. If Q =^ Qi then 

c 

Q' =^Qi where Q'l is more general than Qi up to the arguments whose position is in A. 

c 

Example 3.5 (Example \3.4\ continued) The predicate p has only one argument position, so let 
us consider A := (p f-» {!}) which distinguishes position 1 for predicate p. For any derivation 
step p{s) =^p{si) if we replace s by any term t then there exists a derivation step p{t) =^p(ti). 

c c 

Notice that p(ti) is more general than p{si) up to the argument of p. So, by the intuition described 
above, A is DN for c. Consequently, as in c the body p{f{X)) is more general than the head p{X) 
up to the argument of p which is neutral, by an extended version of Corollary \S.S\ there exists an 
infinite derivation of {c} U {p{X)}. D 

Let us give some more concrete examples of DN positions. 

Example 3.6 The second argument position of the relation symbol append in the program APPEND: 

append([] ,Ys,Ys) . 7. CI 

append([X|Xs] ,Ys, [XiZs]) :- append (Xs ,Ys ,Zs) . 7. C2 

is DN for C2. Notice that a very common programming technique called accumulator passing 
(see for instance e.g. ^281, p. 21-25) always produces DN positions. A classical example of the 
accumulator passing technique is the following program REVERSE. 

reverse (L,R) :- rev(L, [] ,R) . °L CI 

rev([] ,R,R). t C2 

rev([X|Xs] ,RO,R) : - rev(Xs, [X I RO] ,R) . 7. C3 

Concerning termination, we may ignore the second and the third argument of rev in the recursive 
clause C3 while unfolding a query with this clause. Only the first argument can stop the unfolding. 

D 

But we can be even more precise. Instead of only identifying positions that can be totaly 
disregarded as in the above examples, we can try to identify positions where we can place any 
terms for which a given condition holds. 



Example 3.7 Consider the clause c := p{f{X)) ^- p{f{f{X))). If we mean by a DN position 
a position where we can place any terms, then the argument position of p is not DN for c. This 
is because, for example, we have the derivation step p{X) ^=^p{f{f{Xi))) but if we replace X by 

c 

g{X) then there is no derivation step of {c} U {p{g{X))}. However, if we mean by a DN position 
a position where we can place any instances of f{X), then the argument position of p is DN for 

c. a 

In the sequel of the section, we define more precisely DN positions as positions where we can 
place any terms satisfying certain conditions identified by "filters". We use filters to present an 
extension of the relation "is more general than" and we propose an extended version of Corol- 
lary |^1 We offer two syntactic conditions of increasing power for easily identifying DN positions 
from mere inspection of the text of a logic program. The practical impact of such filters will be 
tackled in Sectional 

3.1 Filters 

Let us first introduce the notion of a filter. We use filters in order to distinguish atoms, some 
arguments of which satisfy a given condition. A condition upon atom arguments, i.e. terms, can 
be defined as a function in the following way. 

Definition 3.8 (Term- condition) A term-condition is a function from the set of terms TUc to 
{true, false}. 

Example 3.9 The following functions are term- conditions. 

ftrue- TUc ^ {true, false} 
t 1-^ true 

/i : TUc ^ [true, false} 

t i—t true iff t is an instance of [X\Y] 

/2 : TUc ^ [true, false} 

t 1-^ true iff t unifies with h{a,X) 

n 

Notice that a term-condition might give distinct results for two terms which are equal modulo 
renaming. For instance f2{X) — false and f2{X) = true. However, in Definition 13.121 below . we 
will only consider variant independent term-conditions. 

Definition 3.10 (Variant Independent Term- Condition) A term- condition f is variant indepen- 
dent if, for every term t, f{t) = true implies that f{t') = true for every variant t' oft. 

Example 3.11 (Examvle \'S.y[ continued) ftrue and /i are variant independent while fi is not. D 

We restrict the class of term-conditions to that of variant independent ones because we want 
to extend the relation "is more general than" so that if an atom A is linked to an atom B by 
the extended relation, then every variant of A is also linked to B (see Proposition 13.161 below) . 
This will be essential to establish the forthcoming main Proposition 13 .201 which is an extension of 
Corollarv l3.2l Now we can define what we exactly mean by a filter. 

Definition 3.12 (Filter) A filter, denoted by A, is a function from 11 such that: for each p G H, 
A(p) is a partial function from [1, arity{p)] to the set of variant independent term- conditions. 

Example 3.13 fExamvle \S.fA continued) Let p be a relation symbol whose arity equals 3. The 
filter A which maps p to the function (1 i-^ ftrue, 2 i— > /i) and any g S 11 \ {p} to () is noted 

A:={p^{l^ftrue,2^fl)). D 



3.2 Extension of the Relation "Is More General Than" 

Given a filter A, the relation "is more general than" can be extended in the following way: an atom 
A := p{- ■ • ) is A-more general than B := p{- • • ) if the "is more general than" requirement holds 
for those arguments of A whose position is not in the domain of A(p) while the other arguments 
satisiy their associated term-condition. 

Definition 3.14 (/S.-more general) Let A &e a filter and A and B be two atoms. 

• Let rj be a substitution. Then A is A-more general than B for rj if: 

A=p{si,...,Sn) 

B =p{ti,. ..,tn) 

Vi e [1, n] \ Dom{A{p)), ti = 5^77 

Vi e Dom{A{p)), A(p)(i)(s,) = true. 

• A is A-more general than B if there exists a substitution 77 s.t. A is A-more general than B 
for rj. 

An atomic query Q is A-more general than an atomic query Q' if either Q and Q' are both empty 
or Q contains the atom A, Q' contains the atom B and A is A-more general than B. 

Example 3.15 fExamvle \3.13\ continued) Let 



A 
B 

C 



= p{ b, X ,h{a,X) ) 
= p{ a , [a\b] , X ) 
= p{ a,[a\b], h{Y,b) ) 



Then, A is not A-more general than B and C because, for instance, its second argument X is 
not an instance of [^1^] as required by /i. On the other hand, B is A-more general than A for 
the .substitution {X/h{a, X)} and B is A-more general than C for the substitution {X/h{Y,b)}. 
Finally, C is not A-more general than A because h{Y,b) is not more general than h{a,X) and C 
is not A-more general than B because h{Y,b) is not more general than X. D 

As in a filter the term-conditions are variant independent, we get the following proposition. 

Proposition 3.16 Let A be a filter and A and B be two atoms. If A is A-more general than B 
then every variant of A is A-more general than B. 

The next proposition states an intuitive result: 

Proposition 3.17 Let A be a filter and A and B be two atoms. Then A is A-more general than 
B if and only if there exists a substitution r/ .such that Var{ri) C Var{A, B) and A is A-more 
general than B for rj. 

3.3 Derivation Neutral Filters: Operational Definition 

In the sequel of this paper, we focus on "derivation neutral" filters. The name "derivation neutral" 
stems from the fact that in any derivation of an atomic query Q, the arguments of Q whose position 
is distinguished by such a filter can be safely replaced by any terms satisfying the associated term- 
condition. Such a replacement does not modify the derivation process. 

Definition 3.18 (Derivation Neutral) Let A be a filter and c be a binary clause. We say that A 
is DN for c if for each derivation step Q=^Qi where Q is an atomic query, for each Q' that is 

c 

A-more general than Q and for each variant c' of c variable disjoint with Q' , there exists a query 
Q'l that is A-more general than Qi and .such that Q' ^=>Q[ with input clause c' . This definition 

c 

is extended to binary programs: A is DN for P if it is DN for each clause of P. 



Example 3.19 The following examples illustrate the previous definition. 

• Let us reconsider the program APPEND from Examvle \3. 61 with the term- condition ftrue defined 
in Exaravle \3. ffl and the filter A :— {append ^^ (2 ^^ ftrue))- A is DN for C2. However, A 
is not DN for APPEND because it is not DN for CI. 

• Consider the following clause: 

merge([X|Xs] , [YlYs] , [XiZs]) :- merge(Xs, [Y| Ys] ,Zs) . 

The filter {merge ^^ (2 ^^ A)); where the term- condition /i is defined in Examvle \3.iA is 
DN for this clause. 

In the next subsection, we present some syntactic criteria for identifying correct DN filters. For 
proving that the above filters are indeed DN, we will just check that they actually fulfill these 
syntactic criteria that are sufficient conditions. D 

Derivation neutral filters lead to the following extended version of Corollarv l3.2l ftake A such 
that for any p, A{p) is a function whose domain is empty): 

Proposition 3.20 Let c :— H <— B be a binary clause and IS. be a filter that is DN for c. If B is 

/S.-more general than H then H loops w.r.t. {c}. 

We point out that the above results remain valid when the program under consideration is 
restricted to its set of clauses used in the derivation steps. For instance, although the filter A of 
Example 13. 191 is not DN for APPEND, it will help us to construct queries which loop w.r.t. C2. Such 
queries also loop w.r.t. APPEND. 

Notice that lifting lemmas are used in the literature to prove completeness of SLD-resolution. 
As Definition 13. 181 corresponds to an extended version of the One Step Lifting Lemma IXTl it may 
be worth to investigate its consequences from the model theoretic point of view. 

First of all, a filter may be used to "expand" atoms by replacing every argument whose position 
is distinguished by any term that satisfies the associated term-condition. 

Definition 3.21 Let A be a filter and A be an atom. The expansion of A w.r.t. A, denoted A^/^, 
is the set defined as 

def 

yl|A — {^} U {B G TB^^ I B is A-more general than A for e} 

where e denotes the empty substitution. 

Notice that in this definition, we do not necessary have the inclusion 

{A} C {S e TBc I B is A-more general than A for e} . 

For instance, suppose that A := p{f{X)) and that A maps p to the function (1 ^^ /) where / is 
the term-condition mapping any term t to true iff t is an instance of g{X). Then 

{B G TBc I B is A-more general than A} — {p{t) | t is an instance oi g{X)} 

with A ^ {p{t) I t is an instance oi g{Xy}. 

Term interpretations in the context of logic programming were first introduced in [H] and further 
investigated in ^J and then in j22|. A term interpretation for C is identified with a (possibly 
empty) subset of the term base TBc. So, as for atoms, a term interpretation can be expanded by 
a filter. 

Definition 3.22 Let A be a filter and I be a term interpretation for C. Then /|a is the term 
interpretation for C defined as: 

I\A — \J ^TA • 
Aei 



For any logic program P, we denote by C{P) its least term model. 

Theorem 3.23 Let P be a binary program and A be a DN filter for P. Then C(P)|a = C{P). 

Proof. The inclusion C{P) C C(P)|a is straightforward so let us concentrate on the other one 
i.e. C(P)|A ^ C(P). Let A' e C(P)|a- Then there exists A e C{P) such that A' e A^a- A well 
known result states: 

C{P) = {B e TBc I there exists a successful derivation of P U {B}} (1) 

Consequently, there exists a successful derivation £_ of P{J{A}. Therefore, by successively applying 
Definition l3.18l to each step of ^, one construct a successful derivation of A'. So by (^1 A' G C{P). 

3.4 Some Particular DN Filters 

In this section, we provide two sufficient syntactic conditions for identifying DN filters. 

3.4.1 DN Sets of Positions 

The first instance we consider corresponds to filters, the associated term-conditions of which are 
all equal to ftme (see Example 13. 9|l . Within such a context, as the term-conditions are fixed, each 
filter A is uniquely determined by the domains of the partial functions A(p) for p G 11. Hence the 
following definition. 

Definition 3.24 (Set of Positions) A set of positions, denoted by t, is a function from H to 2^ 
such that: for each p G 11, t{p) is a subset of [1, arity{p)]. 

Example 3.25 Let append and appends be two relation symbols. Assume that arity(append) = 3 
and arity (appends) = 4. Then r := ( append i~->- {2}, appends i-^ {2,3,4} ) is a set of positions. 

D 

Not surprisingly, the filter that is generated by a set of positions is defined as follows. 

Definition 3.26 (Associated Filter) Let t be a set of positions and ftme be the term- condition 
defined in Examvle \S.9l The filter A[r] defined as: 

for each p G 11, A[t](p) is the function from t{p) to {ftrue\ 

is called the filter associated to r. 

Example 3.27 (Examvle \S.25\ continued) The filter associated to r is 

A[t] := {append ^ {2 ^ ftme), appends l-^ (2 l-^ /trne, 3 ^ ftme, 4 *-> ftme))- 

n 

Now we define a particular kind of sets of positions. These are named after "DN" because, as 
stated by Theorem 13.301 below, they generate DN filters. 

Definition 3.28 (DN Set of Positions) Let t be a set of positions. We say that r is DN for a 
binary clause p{si, . . . , s„) ^ g(ii, . . . , i,„) ifi 

{Si is a variable 
Si occurs only once in p{si, . . . , s„) 
Vj G [1,to], s, G Var{tj) ^j€ T{q) . 

A set of positions is DN for a binary program P if it is DN for each clause of P. 



The intuition of Definition 13.281 is the following. If for instance we have a clause c := 
p{X, Y, f{Z)) <— p{g{Y, Z), X, Z) then in the first two positions of p we can put any terms and get 
a derivation step w.r.t. c because the first two arguments of the head of c are variables that appear 
exactly once in the head. Moreover, X and Y of the head reappear in the body but again only in 
the first two positions of p. So, if we have a derivation step p(si, S2, S3) =^piti, ^2, ^3), we can re- 

c 

place si and S2 by any terms s[ and S2 and get another derivation step p{s[, s'2, S3) =^p{t'i, ^27 ^3) 

c 

where tg is the same as ^3 up to variable names. 

Example 3.29 (Examvle \3.25\ continued) r is DN for the program: 

append([X|Xs] ,Ys, [XiZs]) :- append(Xs ,Ys ,Zs) . 
appends (Xs ,Ys ,Zs ,Ts) :- append(Xs, Ys,Us) . 

which is a subset of the binary unfoldings of the program APPENDS: 

append ([],Ys,Ys). 

append([X|Xs] ,Ys, [XiZs]) :- append(Xs ,Ys ,Zs) . 

appends (Xs ,Ys,Zs,Ts) :- append(Xs, Ys,Us) , append(Us ,Zs ,Ts) . 

n 

DN sets of positions generate DN filters. 

Theorem 3.30 Let t be a DN set of positions for a binary program P. Then A[t] is DN for P. 

Proof. As we will see in Section |^^21 this theorem is a particular case of Theorem 13.391 

Notice that the set of DN sets of positions of any binary program P is not empty because, by 
Definition 13.281 tq := (p 1— > | p g 11} is DN for P. Moreover, an atom A is A[To]-more general 
than an atom _B iff A is more general than B. 

3.4.2 DN Sets of Positions with Associated Terms 

Now we consider another instance of Definition 13.181 As we will see, it is more general than the 
previous one. It corresponds to filters whose associated term-conditions have all the form "is an 
instance of t" where i is a term that uniquely determines the term-condition. Notice that such 
term-conditions are variant independent, so it makes sense to consider such filters. Hence the 
following definition. 

Definition 3.31 (Sets of Positions with Associated Terms) A set of positions with associated 
terms, denoted by r+, is a function from 11 such that: for each p G IT, T+(p) is a partial function 
from [l^arityij))] to TUc- 

Example 3.32 Let p and q be two relation symbols whose arity is 2. Then 

T+:= {p^{2^X),q^{2^g{X))) 
is a set of positions with associated terms. D 

The filter that is generated by a set of positions with associated terms is defined as follows. 

Definition 3.33 (Associated Filter) Let t+ be a set of positions with associated terms. The filter 
associated to t+, denoted by A[r+], is defined as: for each p G H, A[t+](p) is the function 

Dom{T^{p)) >—> The set of term- conditions 

I TUc *~* {true, false} 

I t H^ true iff t is an instance of T^{p){i) 
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Example 3.34 fExamvle VJ.'J'/A continued] The filter associated to r+ is 

A[r+]:= (p^(2^/i>, g^(2^/2>> 

where 

/i : TUc ^ {true, false} 

t ^^ true iff t is an instance of X 

/2 : TUc ^-^ {true, false} 

t I— > true iff t is an instance of g{X) 

n 

As for sets of positions, we define a special kind of sets of positions with associated terms. 

Definition 3.35 (DN Sets of Positions with Associated Terms) Let r+ he a set of positions with 
associated terms. We say that t+ is DN for a binary clause p{si, . . . , s„) *— 9(^1, • ■ • , tm) if these 
conditions hold: 

• (DNl) Vi e Dom{T+{p)), Wj 6 [l,n] \ {i}: Var{s^) n Var{sj) = 0, 

• (DN2) \f(i H^ Ui) G T+(p); Si is more general than Ui, 
(DN3) V(j 1-^ Uj) G T+(g); tj is an instance ofuj, 



• 



• (DN4) Vi G Dom{T+{p)), Vj ^ Do7n{T+{q)): Var{si) n Var{tj) 



A set of positions with associated terms is DN for a binary program P if it is DN for each clause 
of P. 

This definition says that any si where i is in the domain of r^ija) {i.e. position i is distinguished 
by T'^): (DNl) does not share its variables with the other arguments of the head, (DN2) is more 
general than the term Ui that i is mapped to by t+ [p) , (DN4) distributes its variables to some tj 
such that j is in the domain of T^{q) {i.e. position j is distinguished by r+). Moreover, (DN3) 
says that any tj , where j is distinguished by t'^ , is such that tj is an instance of the term Uj that 
j is mapped to by T+(g). 

Example 3.36 fExamvle \S.S2\ continued) t^ is DN for the following program: 

p(f (X),Y) :- q(X,g(X)). 
q(a,g(X)) :- q(a,g(b)). 



The preceding notion is closed under renaming: 

Proposition 3.37 Let c be a binary clause and r+ be a set of positions with associated terms that 
is DN for c. Then t+ is DN for every variant of c. 

Notice that a set of positions is a particular set of positions with associated terms in the 
following sense. 

Proposition 3.38 Let t be a set of positions and X be a variable. Let t^ be the set of positions 
with associated terms defined as: for each p G H, r+(p) :— ( t(j)) ^-> {X} ). Then, the following 
holds. 

1. An atom A is A[r]-more general than an atom B iff A is A[r+]-more general than B. 

2. For any binary clause c, t is DN for c iff t^ is DN for c. 
Proof. A proof follows from these remarks. 
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Item n is a direct consequence of the definition of "A-more general" (see Definition I3.f4|l 
and the definition of the filter associated to a set of positions (see Definition I3.2()|) and to a 
set of positions with associated terms (see Definition I3.33|l . 

Item 121 is a direct consequence of the definition of DN sets of positions (see Definition 
and DN sets of positions with associated terms (see Definition 13. 35|) . 



The sets of positions with associated terms of Definition 13.351 were named after "DN" because 
of the following result. 

Theorem 3.39 Let P be a binary program and r+ be a set of positions with associated terms that 
is DN for P. Then A[r+] is DN for P. 

As in the case of sets of positions, the set of DN sets of positions with associated terms of any 
binary program P is not empty because, by Definition 13.351 t^ := {p ^^ Q \ p € 11) is DN for 
P. Moreover, an atom A is A[TQ^]-more general than an atom i? iff ^ is more general than B. 
Finally, in Appendix 1X1 we give an incremental algorithm (see Section IT!^ that computes a DN 
set of positions with associated terms. Its correctness proof is also presented. 

3.5 Examples 

This section presents some examples where we use filters obtained from DN sets of positions and 
DN sets of positions with associated terms to infer looping queries. As the filters we use in each 
case are not "empty" {i.e. are not obtained from tq or Tq ), we are able to compute more looping 
queries than using the classical subsumption test. 

Example 3.40 Consider the program APPEND that we introduced in Examvle \S.f\ Every infinite 
derivation w.r.t. APPEND starting from an atomic query only uses the non-unit clause C2. There- 
fore, as we aim at inferring looping atomic queries w.r.t. APPEND, we only focus on C2 in the 
sequel of this example. 

As in C2 the body, which is append{Xs, Ys,Zs), is more general than the head, which is 
append{[X\Xs], Ys,[X\Zs]), by Corollary \'3.'^ we have that the query append[[X\Xs\, Ys,[X\Zs\) 
loops w.r.t. {C2}. Consequently, by the One Step Lifting Lemma VS.lX each query that is more 
general than append{[X\Xs], Ys, [X\Zs]) also loops w.r.t. {C2}. 

But we can be more precise than that. According to Definition \3.2<^ t := ( append ^^ {2} ) 
is a DN set of positions for {C2}. The filter associated to r (see Definition \3.26l) is A[t] :— 
{ append h^ (2 i— > ftrue) )• By Theorem \S.H(A A[t] is a DN filter for {C2}. Consequently, by 
DeRnition \S.18[ each query that is A[r]-more general than append {[X\Xs], Ys, [X\Zs]) loops w.r.t. 
{C2}. This means that 

,,, , , , „n ^2 is any term and 

append(ti,t2,t^) ^ 1 B r , , . , ,, n^ii^i n^ir^i 

11,1:3 IS more general than [A \Xs\, [A \Zs\ 

is a set of atomic queries that loop w.r.t. {C2}, hence w.r.t. APPEND. This set includes the 'well- 
typed' query append{As, [], Bs). D 

Example 3.41 Consider the program REVERSE that was introduced in Example \3.6\ As in the 
example above, in order to infer looping atomic queries w.r.t. REVERSE, we only focus on the 
non-unit clauses CI and C3 in the sequel of this example. More precisely, we process the relation 
symbols of the program in a bottom-up way, so we start the study with clause C3 and end it with 
clause CI. 

According to Definition \3.2t^ t :— ( rev i~~+ {2,3} } is a DN set of positions for {C3}. The 
filter associated to t (see Definition \3.2b]) is A[r] := ( rev i-^ (2 (-^ ftrue, 3 1-^ ftrue) )■ By 
Theorem WM A[t] is DN for {C3}. As rev{Xs, [X\Ro\,R) (the body of C3) is A[T]-more general 
than rev([X\Xs], Rg, R) (the head of C3), by Provosition \3 .2l% we get that rev(\X\Xs], Rg, R) loops 
w.r.t. {C3}. Notice that, unlike the example above, here we do not get this result from Corollary\. 
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as rev{Xs,[X\Ro], R) is not more general than rev{[X\Xs], Eg, R). Finally, as A[r] is DN for 
{C3}, by Definition \3.1t^ we get that each query that is A[r]-more general than rev{[X\Xs\, Eg, E) 
loops w.r.t. {C3}, hence w.r.t. REVERSE. This means that 



Q:= {rev{tiMM) e TBc 



t2 and ^3 are any terms and 
ti is more general than [X\Xs] 

is a set of atomic queries that loop w.r.t. REVERSE. This set includes the 'well-typed' query 
reviAs, [],[]). 

Now, consider clause CI. As rev{L, [],i?) (its body) is an element of Q, then rev{L, [],i?) loops 
w.r.t. {C3}, hence w.r.t. {C1,C3}. Consequently, by Corollarv Vd.iA reverse{L, R) (the head of CI) 
loops w.r.t. {C1,C3}. Let r' := ( rev i-^ {2,3}, reverse i-^ {2} ). By Definition \3.28[ t' is DN for 
{C1,C3}, so A[r'] is DN for {C1,C3}. Consequently, each query that is A[r']-more general than 
reverse{L,R) also loops w.r.t. {C1,C3} hence w.r.t. REVERSE. This means that 

<yreverse{X,t) G TBc \ X is a variable and t is any termj 

is a set of atomic queries that loop w.r.t. REVERSE. This set includes the 'well-typed' query 

reverse{As, []). D 

Example 3.42 Consider the two recursive clauses of the program MERGE where we have removed 
the inequalities: 

merge ( [X I Xs] , [Y I Ys] , [X I Zs] ) : - merge (Xs , [Y | Ys] , Zs) . 7. C3 
merge([X|Xs] , [YlYs] , [YiZs]) : - merge( [X I Xs] , Ys.Zs) . 7. C4 

Every set of positions r that is DN for {C3} is such that T{merge) — because each argument 
of the head of C3 is not a variable (see Definition \S.2^) . Hence, using Provosition \S.2(% with 
a filter obtained from a DN set of positions leads to the same results as using Corollary \ 
as merge{Xs,[Y\Ys\, Zs) is more general than merge{[X\Xs],[Y\Ys\,[X\Zs]), by Corollary] 
merge([X\Xs],[Y\Ys],[X\Zs]) loops w.r.t. {C3}. 5*0, by the One Step Lifting Lemma iS.ll each 
query that is more general than merge{[X\Xs], [Y\ Y.s], [X\Zs]) also loops w.r.t. {C3}, hence w.r.t. 
MERGE. 

But we can be more precise than that. According to Definition l,V..¥5[ r^ := {merge ^-> (2 i~^ 
[Y\ Y.s]) ) is a set of positions with associated terms that is DN for {C3}. Hence, by Theorem \8.3iA 
the associated filter A[t^] (see Definition \5.33\) is DN for |C3|. So, by Definition \3. T^ each query 
that is A[T^]-more general than merge{[X\X.s], [Y\ Ys], [X\Zs]) loops w.r.t. {C3}. This means that 



merge{ti,t2,t3) e TBc 



t2 is any instance of [Y\ Ys] and 

ti,t3 is more general than [X\Xs], [X\Zs] 



is a set of atomic queries that loop w.r.t. MERGE. Notice that this set includes the 'well-typed' query 
merge{As, [Q],Bs). Finally, let us turn to clause C4. Eeasoning exactly as above with the set of 
positions with associated terms {merge i-^ (1 i-^ [X|Xs]) ) which is DN for {C4}, we conclude that: 



I 



merge{ti,t2,t3) G TBc 



ti is any instance of [X\X.s] and 

t2,t3 is more general than [Y\Ys], [Y\Z.s] 



is a set of atomic queries that loop w.r.t. MERGE. Notice that this set includes the 'well-typed' query 
merge{[0], As, Bs). D 

4 Algorithms 

We have designed a set of correct algorithms for full automation of non-termination analysis of 
logic programs. These algorithms are given in Appendix IXI with their correctness proofs. In this 
section, we present the intuitions and conceptual definitions underlying our approach. 
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4.1 Loop Dictionaries 

Our technique is based on a data structure called dictionary which is a set of pairs {BinSeq,T^) 
where BinSeq is a finite ordered sequence of binary clauses and t+ is a set of positions with 
associated terms. In the sequel, we use the list notation of Prolog and a special kind of dictionaries 
that we define as follows. 

Definition 4.1 (Looping Pair, Loop Dictionary) A pair {BinSeq, t'^), where the list BinSeq is a 
finite ordered sequence of binary clauses and t+ is a set of positions with associated terms, is a 
looping pair if t+ is DN for BinSeq and: 

• either BinSeq = [iJ «— _B] and B is A[T+]-more general than H , 

• or BinSeq = [H ^ B, Hi «— Bi \ BinSeqi] and there exists a set of positions with associated 
terms r^ such that {[Hi ^- Bi \ BinSeq-^, t^) is a looping pair and B is /^[Ti]-more general 
than Hi . 

A loop dictionary is a finite set of looping pairs. 

Example 4.2 The pair {BinSeq := [Hi ^ Bi,H2 ^ B2, H3 ^ B3], ri+) where 

Hi ^ Bi := r{X) ^ q{X, f{,f{X))) 
H2 ^ B2 := q{X, f(Y)) ^ p{f{X), a) 
H3 ^ Bs :- p{f{g{X)),a) ^ p{X, a) 

and T^ := (p h^ (2 h^ a), q 1-^ {2 h^ f{X))) is a looping pair: 

• Let T3 := (p H^ (2 H^ a)). Then T3 is a DN set of positions with associated terms for 
[H3 <— B3]. Moreover, B3 is A{T^)-more general than H3. Consequently, {[H3 <— ,63], T3) 
is a looping pair. 

• Notice that B2 is A{T![)-more general than H^. Now, let T2 '■= t^ ■ Then Tj is DN for 
[H2 ^ B2,H3 ^ B3]. So, {[H2 <— ^2, -ffa <" B3],T2) is a looping pair. 

• Finally, notice that Bi is A{T2)-more general than H2. As t^ is DN for BinSeq, we conclude 
that {BinSeq, T^) is a looping pair. D 

A looping pair immediately provides an atomic looping query. It suffices to take the head of 
the first clause of the binary program of the pair: 

Proposition 4.3 Let {[H ^ B\BinSeq],T^) be a looping pair. Then H loops with respect to 
[H ^ B[BinSeq]. 

Proof. By induction on the length of BinSeq using Proposition 13.201 Corollary 13.31 and Theo- 
rem l3.39l So, a looping pair denotes a proof outline for establishing that H left loops. Moreover, 
looping pairs can be built incrementally in a simple way as described below. 

4.2 Computing a Loop Dictionary 

Given a logic program P and a positive integer max, the function inf er_loop_dict from Ap- 
pendix ^ first computes Tp f max (the first max iterations of the operator Tp), which is a 
finite subset of bin_unf {P) . Then, using the clauses of Tp | max, it incrementally builds a loop 
dictionary Diet as follows. 

At start. Diet is set to 0. Then, for each clause H ^ B in Tp t max, the following actions 
are performed. 

• inf er_loop_dict tries to extract from H <— B the most simple form of a looping pair: it 
computes a set of positions with associated terms r"*" that is DN for H ^ B, then it tests if 
B is A[T+]-more general than H. If so, the looping pair {[H ^ B],t^) is added to Diet. 
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• inf er_loop_dict tries to combine H ^ B to some looping pairs that have already been 
added to Diet in order to build other looping pairs. For each {[Hi ^- Bi\BinSeqi],Ti) in 
Diet, if B is A[T]^]-more general than Hi, then a set of positions with associated terms 
r+ that is DN for [H ^ B,Hi ^- Bi\BinSeqi] is computed and the looping pair {[H ^- 
B,Hi ^- Bi\BinSeqi],T^) is added to Diet. 

Notice that in the second step above, we compute t+ that is DN for [H ^ B,Hi ^- Bi\BinSeqi]. 
As we already hold t^ that is DN for [Hi ^- Bi\BinSeqi], it is more interesting, for efficiency 
reasons, to compute r+ from Ti instead of starting from the ground. Indeed, starting from Ti , 
one uses the information stored in Ti about the program [Hi <— Bi\BinSeqi], which may speed 
up the computation substantially. This is why we have designed a function dna that takes two 
arguments as input, a binary program BinProg and a set of positions with associated terms t+. It 
computes a set of positions with associated terms that is DN for BinProg and that refines r"*" . On 
the other hand, the function unit_loop calls dna with t+^j. which is the initial set of positions with 
associated terms defined as follows: Dom,{T^^,j.{p)) — [1, arity{p)] for eachp G 11 and T+^^(p)(i) is 
a variable for each i G [1, arity{p)]. 

Example 4.4 Consider the program APPENDS 

appends (Xs,Ys,Zs, Us) :- append(Xs, Ys, Vs) , append(Vs ,Zs ,Us) . 

augmented with the APPEND program. The set T^ppends T 2 ineludes: 

append([A|B] ,C, [AID]) : - append(B,C,D) . % BCl 

appends ( A, B,C,D) :- append ( A, B, E) . 7. BC2 

appendSCD ,A,B,C) :- append ( A, B, C) . % BC3 

From elause BCl we get the looping pair (BinSeqi,T^) where 

BinSeq^ = [append{[Xi\X2], X3,[Xi[X4]) ^- append{X2, X3, X^jj 

and Ti{append) = (2 ^^ -^^3). From this pair and the clause BC2, we get the looping pair 
{BinSeq2,T2) where: 

BinSeq2 = [ appends {Xi, X2, X^, X4) ^- append{Xi, X2, X^), 

append{[Xi\X2],X3, [X1IX4]) ^ append{X2, X3, X4) ] 

and T2 (append) = (2 h^ X^) and T2 (appends) = (2 h^ ^2j3 h^ X^,4 h^ X4). 

Finally, from (BinSeqi,T^) and BC3, we get the looping pair (BinSeq^,T^) where: 

BinSeq^ — [ appends ([],Xi,X2,X^) ^- append (Xi ,X2,X^), 

append([Xi\X2], X3, [Xi[Xi]) ^ append(X2, X3, X^) ] 

and T^ (append) = (2 h^ X3) and t^ (appends) = (3 h^ X2) . □ 

Example 4.5 Consider the program PERMUTE: 

deleteCX, [X|Xs] ,Xs). 

delete(Y, [XlXs] , [XlYs]) :- delete(Y,Xs ,Ys) . 

permute ( [],[]). 

permute([X|Xs] , [YlYs]) :- delete(Y, [X I Xs] ,Zs) , permute(Zs ,Ys) . 

The set Tp^nfmYE t 1 includes: 
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delete(B, [CID] , [ClE]) :- delete (B,D,E) . 7. BCl 

permute ( [B I C] , [D I E] ) :- delete (D, [B I C] ,F) . "/. BC2 

From clause BCl we get the looping pair {BinSeq^,T^) where 

BinSeq^ = [delete{Xi, [X2IX3], [^21^4]) ^ delete{Xi, X3, X4)] 

and T^ (delete) = (1 h^ ^i). From this pair and BC2, we get the looping pair {BinSeq2, Tj ) where: 

BinSeq^ = [ permute{[Xi\X2], [^3|^4]) ^ delete{X3, [Xi\X2],X5), 
deleteiXi, [X2IX3], [X2IX4]) ^ delete{Xi, X3,Xi) ] 

and T2 (delete) = (1 h^ Xi) and T2 (permute) = (2 i-^ [X3IX4]). D 

4.3 Looping Conditions 

One of the main purposes of this article is the inference of classes of atomic queries that left loop 
w.r.t. a given logic program. Classes of atomic queries we consider are defined by pairs (A,t^) 
where A is an atom and r+ is a set of positions with associated terms. Such a pair denotes the 
set of queries ^fr+i the definition of which is similar to that of the expansion of an atom, see 
Definition IXni 

Definition 4.6 Let A be an atom and r+ be a set of positions with associated terms. Then A'^^+ 
denotes the class of atomic queries defined as: 

A|^+ =^ {A} U{B e TBc I B IS A[r+]-more general than A} . 

Once each element of v4|t-+ left loops w.r.t. a logic program, we get what we call a looping 
condition for that program: 

Definition 4.7 (Looping Condition) Let P be a logic program. A looping condition for P is a 
pair (A^T^) such that each element of Ai^t-+ left loops w.r.t. P. 

The function inf er_loop_cond takes as arguments a logic program P and a non-negative 
integer max. Calling inf er_loop_dict(P, rreaa;), it first computes a loop dictionary Diet. Then, 
it computes from Diet looping conditions for P as follows. The function extracts the pair (H,t^) 
from each element ([H ^- B\BinSeq],T~^) of Diet. By Proposition 14.31 H loops w.r.t. [H ^~ 
B\BinSeq\. As r+, hence A[r+], is DN for [H ^ B\BinSeq], by Definition EIHl each element of 
i/|T-+ loops w.r.t. [H ^- B\BinSeq]. Finally, as [H ^- B\BinSeq] C Tp | max C bin_unf(P), by 
Theorem 12. II each element of H^.^+ left loops w.r.t. P. 

Example 4.8 (Example \4-4\ continued) From each looping pair we have infered, we get the fol- 
lowing information. 

• ( append ([Xi I X2], X3, [Xi 1X4]), Tj^) is a looping condition. So, each query append(ti,t2,t3), 
where [X1IX2] = tirj and [X1IX4] = t^rj for a substitution rj and t2 is an instance of X3 
(because t^ (append) (2) = X3), left loops w.r.t. APPENDS. In other words, each query 
append(ti,t2,t3), where [XilA'2] = tirj and [X1IX4] = t^rj for a substitution rj and ^2 'is 
any term, left loops w.r.t. APPENDS. 

• (append3(Xi, X2, X3, X4),T2) is a looping condition. As we have T2 (appends) (2) = X2, 
T2 (appends) (3) — X^ and T2 (appends) (4) = X4, this means that each query of form 
appends (xi,t2,t3,t4), where t2, is and t^ are any terms, left loops w.r.t. APPENDS. 

• (appends (W,Xi,X2,X3),T^) is a looping condition. So, as t^ (appends) (3) = X2, this 
means that each query of form appendS(W, Xi,t, X3), where t is any term, left loops w.r.t. 
APPENDS. D 
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Example 4.9 (Example \4-5\ continued) From each looping pair we have infered, we get the fol- 
lowing information. 

• {delete{Xi, [X2IX3], [X2IX4]), t^) is a looping condition. As T^{delete){l) = Xi, this means 
that each query of form delete{ti,t2,t^), where ii is any term and [X2IX3] = ^2'? o.'^^d 
[X2IX4] — t^rj for a substitution rj, left loops w.r.t. PERMUTE. 



• 



{permute{[X i\X2\, [X^\X4\)^T2) is a looping condition. As T2 {permute) [2) — [X3IX4], this 
means that each query of form permute{ti,t2), where ti is more general than [X1IX2] and 
t2 is any instance of [X3IX4], left loops w.r.t. PERMUTE. D 

5 An Application: Proving Optimality of Termination Con- 
ditions 

|2b| presents a tool for inferring termination conditions that are expressed as multi- modes, i.e. 
as disjunctions of conjunctions of propositions of form "the i-th argument is ground" . In this 
section, we describe an algorithm that attempts proofs of optimality of such conditions using the 
algorithms for non-termination inference of the previous section. 

5.1 Optimal Terminating Multi-modes 

Let P be a logic program and p G lip be a relation symbol, with arity{p) = n. First, we describe 
the language we use for abstracting sets of atomic queries: 

Definition 5.1 (Mode) A mode mp for p is a subset of [1,n\, and denotes the following set of 
atomic goals: [rup] = {p(ii, . . . ,t„) G TBc \ Vi G rup Var{ti) — 0}. The set of all modes for p, 
i.e. 2[^'"'l, is denoted modes{p). 

Note that if nip — then [rap] — {p(ii, . . . ,i„) G TBc}. Since a logic procedure may have 
multiple uses, we generalize: 

Definition 5.2 (Multi-mode) A multi-mode Mp for p is a finite set of modes for p and denotes 
the following set of atomic queries: [Mp] ~ UmeM [m] • 

Note that if Mp = 0, then [Alp] ~ 0. Now we can define what we mean by terminating and 
looping multi-modes: 

Definition 5.3 (Terminating mode, terminating multi-mode) A terminating mode nip for p is a 
mode for p such that any query in [nip] left terminates w.r.t. P. A terminating multi-mode TMp 
for p is a finite set of terminating modes for p. 

Definition 5.4 (Looping mode, looping multi-mode) A looping mode nip for p is a mode for p 
such that there exists a query in [nip] which left loops w.r.t. P. A looping multi-mode LMp for p 
is a finite set of looping modes for p. 

As left termination is instantiation-closed, any mode that is "below" (less general than) a 
terminating mode is also a terminating mode. Similarly, as left looping is generalization-closed, 
any mode that is "above" (more general than) a looping mode is also a looping mode. Let us be 
more precise: 

Definition 5.5 (Less-general, morc-general) Let Mp he a multi-mode for the relation symbol p. 
We set: 

less -general (Mp) = {ni G modes{p) \ Elm' G Mp [m] C [m']} 
more-general{Mp) ^ {m G modes{p) \ 3m' G Mp [m'] C [m]} 



17 



loopingjnodes(L, p): 

in: L: a finite set of looping conditions 
p: a predicate symbol 



out 

1 
2 
3 

4 
5 



a looping multi-mode for p 



LMp 



for each (p{ti, ...,<„), r"*") G L do 

mp := Dom{T+{p)) U {i e [l,n\ \ Var{U) = 0} 

LMp := LMp U {mp} 
return LM^ 



Figure 1: 

Wc are now equipped to present a definition of optimality for terminating multi-modes: 

Definition 5.6 (Optimal terminating multi-mode) A terminating multi-mode TMp for p is opti- 
mal if there exists a looping multi-mode LMp verifying: 

modes{p) = less-general{TM p) U more-general{LM p) 

Otherwise stated, given a terminating multi-mode TMp, if each mode which is not less general 
than a mode of TMp is a looping mode, then TMp characterizes the operational behavior of p 
w.r.t. left termination and our language for defining sets of queries. 

Example 5.7 Consider the program APPEND. A well-known terminating multi-mode is the set 
TM append = {{llil^}}- Indeed, any query of the form append(t,Ys,Zs) or append(Xs,Ys,t), 
where t is a ground term (i.e. .such that Var{t) — 0), left terminates. We have: 

less _general{TM append) = {{!}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3}} 

On the other hand, append(Xs, [] ,Zs) left loops. Hence LM append — {{2}} is a looping condition 
and more _general{LM append) — {0:{2}}- 

Since modes {append) — less -general{TM append) U more-general{LM append), we conclude that 
the terminating multi-mode TM append is optimal. D 

5.2 Algorithms 

Suppose we hold a finite set L of looping conditions for P. Then, each element {p{ti, . . . , tn), t"*") 
of L provides an obvious looping mode for p: it suffices to take {i € [1,«] | Var{ti) = 0}. But 
actually, we can extract more information from L. Let p{ti, . . . , f^) be an atom such that: 

• for each (i h-> Uj) g t+(p), t^ is a ground instance of Ui, 

• for each i in [1, n] \ Dom{T^{p)), t[ = ti. 

Then, p{t'i, . . . ,t'^) belongs to p{ti, . . . ,tn)ir+, hence it left loops w.r.t. P. Consequently, we 
have that Dom{T^{p)) U {i G [Ij't-] | Variti) = 0} is a looping mode for p. The function 
loopingjnodes of Fig.^is an application of these remarks. 

Now we have the essential material for the design of a tool that attempts proofs of optimality 
of left terminating multi-modes computed by a termination inference tool as e.g. cTI 26 or 
TerminWeb J7|. For each pair (p, 0) in the set the function optimal_tc of Fig. [5] returns, we can 
conclude that the corresponding TMp is the optimal terminating multi-mode which characterizes 
the operational behavior of p with respect to Cterm ■ 
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optimal_tc(P, max, {TMp}p^Yip)'- 

in: P: a logic program 

max: a non- negative integer 

{TMp}p^Yip: a finite set of terminating multi-modes 
out: a finite set of pairs {p, Mp) such that p G lip and 

Mp is a multi-mode for p with no information w.r.t. its left behaviour 
note: if for each p E Up, Mp — 0, then { TMp}p^Up is optimal 

1; Res := 

2; L := infer_loop_cond(P, max) 

3: for each p G lip do 

4: LMp :— looping_inodes(L,p) 

5: Mp :— modes(p) \ (less_general( TMp) U more_general(LMp)) 

6: Res :^ Res U {{p, Mp)} 

7: return Res 



Figure 2: 

Example 5.8 (Exam.ple \4-(^ continued) We apply our algorithm to the program APPENDS of Ex- 
ample \T^\ We get that 

L:={ {append{[Xi\X2],X3,[Xi\Xi]), t+), 
(appends {Xi,X2,X3,Xi), t+), 

(appends i[],X,,X2,X3), t+) } 

is a finite set of looping conditions for APPENDS (see Example \4-S\ ) with 

Dom(T^ (append)) = {2} 
Dom(T+ (appends)) = {2,3,4} 
Dom(T^ (appends)) = {3} 

So, for append we have: 

LM append '■= looping_modes(L, append) = {{2}} 
more_general(LM append) = {0:{2}} 

TM append = {{1},{3}} 

less_general(TM append) = {{!}, {3}, {1, 2}, {1, 3}, {2, 3}, {1, 2, 3}} 



Ma 



ad = {} 



For appends, we get: 

• the looping mode {2,3,4} from (appendS(Xi, X2, X^, X4),T2) and 

• the looping mode nip := {1,3} from (appends ([], Xi, X2, X^), T3 ) (notice that 3 G nip because 
Dom(T^ (appends)) = {3} and 1 G rup because of constant [] which is the first argument of 
appendS([],Xi, X2,X3).) 
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So, we have: 

LMappends ■= looping^odes{L, appeudS) = {{2, 3, 4} , {1, 3}} 
more_general{LM appends) = {0, {1}, {2}, {3}, {4}, {1, 3}, {2, 3}, {2, 4}, 

{3,4},{2,3,4}} 

TM appends = {{1 , 2}, {1, 4}} 

less_general{TM appends) = {{1, 2}, {1, 4}, {1, 2, 3}, {1, 2, 4}, {1, 3, 4}, 

{1,2,3,4}} 

M appends = { } 

Hence in both cases, we have characterized the left behaviour of the predicates by using two com- 
plementary tools. D 

5.3 An Experimental Evaluation 

We have implemented^ the algorithms presented in Sections 0] and 15.21 The binary unfoldings 
algorithm is derived from the one described in [7|, where we added time stamps to precisely 
control what is computed at each iteration. Looping modes are computed starting from the leaves 
of the call graph then moving up to its roots. The cTI termination inference tooP is detailed in 
[26[I24| . Here is the configuration we used for this experiment: Intel 686, 2.4GHz, 512Mb, Linux 
2.4, SICStus Prolog 3.10.1, 24.8 MLips. Timings in seconds are average over 10 runs. 

First we have applied them on some small programs from standard benchmarks of the termi- 
nation analysis literature |30[01l^ (predefined predicates were erased). The column opt? of Table 
^ indicates whether the result of cTI (see "5^) is proved optimal (/) or not {?). The column 
max gives the least non-negative integer implying optimality or the least non-negative integer n 
where it seems we get the most precise information from non-termination inference (i.e. for n and 
n + 1, the analyser delivers the same results). Then timings in seconds (t[s]) appear, followed by 
a pointer to a comment to the notes below. 
Notes: 

1. The predicate fold/3 is defined by: 

fold(X, [] ,X). 

fold(X, [YlYs] ,Z) :- op2(X,Y,V), f old(V, Ys ,Z) . 

When the predicate op2/3 is defined by the fact op2(A,B,C), the result of cTI is optimal. 
When the predicate op2/3 is defined by the fact op2(a,b,c), no looping mode is found and 
the result of cTI is indeed sub-optimal as the query fold(X,Y,Z) terminates. 

2. Termination proofs for mergesort require the list-size norm, while cTI applies the term-size 
norm. 

3. The result of cTI is not optimal. The analyzed program: 

p(A,B) :- q(A,C),p(C,B). 

p(A,A). 

q(a,b) . 



has finite binary unfoldings because there is no function symbol. Hence its termination is 
decidable (see |7j)- This could be easily detected at analyze time. We notice that no looping 
mode is found. But as any constant is mapped to by the term-size norm, the modes 
modes{p) remain undecided for cTI while they all terminate. 



^Available from http://www.univ-reunion.fr/~gccJ 
^Available from http://www.cs.unipr.it/cTI 
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Table 1: Some De Schreye's, Apt's, and Pliimcr's programs. 







cTI 






Optimal 




program 


top-lcvcl predicate 


term-cond 


t[s] 


opt? 


max 


t[s] 


cf. 


permute 


pcrmutc(X,Y) 


X 


0.01 


•" 


1 


0.01 




duplicate 


duplicatc(X,Y) 


X\JY 


0.01 


^ 


1 


0.01 




sum 


sum{X,Y,Z) 


XyYM z 


0.01 


■/ 


1 


0.01 




merge 


merge(X,Y,Z) 


{X KY)y z 


0.02 


/ 


1 


0.01 




dis-con 


dis(X) 


X 


0.02 


■/ 


2 


0.01 




reverse 


rcversG(X,Y,Z) 


X 


0.02 


■/ 


1 


0.01 




append 


append{X,Y,Z) 


xy z 


0.01 


•/ 


1 


0.01 




list 


list(X) 


X 


0.01 


/ 


1 


0.01 




fold 


fold{X,Y,Z) 


Y 


0.01 


7 


2 


0.01 


note 1 


Ite 


goal 


1 


0.01 


■/ 


1 


0.01 




map 


map(X,Y) 


xvy 


0.01 


■/ 


2 


0.01 




member 


member(X,Y) 


Y 


0.01 


/ 


1 


0.01 




mergesort 


mergesort(X,Y) 





0.04 


7 


2 


0.01 


note 2 


mergesort_ap 


mergesort_ap(X,Y,Z) 


Z 


0.08 


7 


2 


0.02 




naive.rev 


naive_rev(X,Y) 


X 


0.02 


■/ 


1 


0.01 




ordered 


ordered (X) 


X 


0.01 


■/ 


1 


0.01 




overlap 


overlap(X,Y) 


X AY 


0.01 


■/ 


2 


0.01 




permutation 


pcrmutation(X,Y) 


X 


0.03 


■/ 


1 


0.01 




quicksort 


quicksort(X,Y) 


X 


0.05 


/ 


1 


0.01 




select 


sclcct{X,Y,Z) 


YV Z 


0.01 


■/ 


1 


0.01 




subset 


subset{X,Y) 


X AY 


0.01 


■/ 


2 


0.01 




sum.peano 


sum{X,Y,Z) 


YV Z 


0.01 


/ 


1 


0.01 




pl2.3.1 


P(X,Y) 





0.01 


7 


1 


0.01 


note 3 


pl3.5.6 


P(X) 


X 


0.01 


■/ 


2 


0.01 




pl4.4.6a 


pcrm(X,Y) 


X 


0.02 


/ 


1 


0.01 




pl4.5.2 


s{X,Y) 





0.03 


■/ 


1 


0.01 




pl4.5.3a 


P(X) 





0.01 


■/ 


1 


0.01 




pl5.2.2 


turing(X,Y,Z,T) 





0.08 


7 


2 


0.03 


note 4 


pl7.2.9 


mult(X,Y,Z) 


X AY 


0.02 


■/ 


4 


0.03 


note 5 


pl7.6.2a 


rcach{X,Y,Z) 





0.02 


7 


1 


0.01 


note 6 


pl7.6.2b 


rcach(X,Y,Z,T) 





0.02 


7 


1 


0.01 




pl7.6.2c 


roach(X,Y,Z,T) 


Z AT 


0.02 


7 


2 


0.02 




plS.S.la 


minsort{X,Y) 


X 


0.03 


■/ 


2 


0.02 




pl8.4.1 


cvcn(X) 


X 


0.02 


■/ 


2 


0.01 




pl8.4.2 


c(X,Y) 


X 


0.05 


■/ 


3 


0.04 
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Table 2: Some middle-sized programs. 



program 


cTI 






Optimal 




1 










max 


=1 


max=2 


max=3 1 


name 


clauses 


Q% 


t[s] 


Opt% 


t[s] 


Opt% 


t[s] 


Opt% 


t[s] 


aim 


177 


48 


1.00 


46 


0.14 


68 


1.34 


74 


32.4 


bid 


50 


100 


0.14 


55 


0.02 


90 


0.08 


95 


0.50 


boyer 


136 


84 


0.30 


80 


0.03 


96 


0.22 


100 


3.66 


browse 


30 


53 


0.26 


46 


0.03 


80 


0.18 


100 


6.05 


credit 


57 


100 


0.11 


91 


0.02 


95 


0.11 


100 


4.46 


peephole 


134 


88 


1.08 


23 


0.06 


70 


3.62 


70 


406 


plan 


29 


100 


0.11 


68 


0.02 


81 


0.09 


81 


0.37 


qplan 


148 


61 


1.13 


50 


0.11 


79 


1.60 


81 


1911 


rdtok 


55 


44 


0.65 


44 


0.11 


88 


40.2 


7 


> 3600 


read 


88 


52 


1.72 


39 


0.04 


47 


0.80 


47 


10.9 


warplan 


101 


32 


0.49 


37 


0.07 


83 


0.99 


91 


21.5 



4. The analyzed program (from |3U) . p. 64) simulates a Turing machine. The result of cTI is 
optimal. 

5. With respect to the program: 

mult(0,A,0) . 

mult(s(A) ,B,C) :- mult (A,B,D) ,add(D,B,C) . 

add(0,A,A). 

add(s(A),B,s(C)) : - add(A,B,C) . 



the query mult (s(s(0)) ,A,B) is automatically detected as looping, although mult(0,A,B) 
and mult(s(0) ,A,B) do terminate. 

6. These three programs propose various definitions of the reachability relation between two 
nodes in a list of edges. For the first and the third definition, cTI is indeed optimal. For the 
second one, cTI is not optimal. 

Next, we have applied the couple of analyzers to some middle-sized Prolog programs, see Table 
13 Again, predefined predicates were all erased, while they are usually taken into account for cTI 
which of course improves the analysis. In other words, we only consider the logic programming 
skeleton of each program. The first two columns give the name of the file and its size (number of 
clauses). The fourth column indicates the running time (in seconds) of the termination analysis, 
while the third column is the ratio of predicates for which a non-false termination condition is 
computed over the total number of predicates defined in the program. For instance, cTI is able 
to show that there is at least one terminating mode for 48% of the predicates defined in the 
program ann. We ran the non-termination analyzer with 1 < max < 3 iterations. For each value 
of max, we give the running time (in seconds) and the ratio of predicates for which looping modes 
complement terminating modes. For example, with respect to the program ann, for max = 2 we 
get the full complete mode termination behavior of 68% of all the defined predicates. 

We note that when we increase max, we obtain better results but the running times also 
increase, which is fairly obvious. For max = 3, we get good to optimal results but the binary 
unfoldings approach reveals its potentially explosive nature: we aborted the analysis of rdtok 
after one hour of computation. 

In conclusion, from such a naive implementation, we were rather surprised by the quality of the 
combined analysis. Adopting some more clever implementation schemes, for instance computing 
the binary unfoldings in a demand driven fashion, could be investigated to improve the running 
times. 
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6 Related Works 

Some extensions of the Lifting Theorem with respect to infinite derivations are presented in |18| . 
where the authors study numerous properties of finite failure. The non-ground finite failure set 
of a logic program is defined as the set of possibly non-ground atoms which admit a fair finitely 
failed SLD-tree w.r.t. the program. This denotation is shown correct in the following sense. If two 
programs have the same non-ground finite failure set, then any ground or non-ground goal which 
finitely fails w.r.t. one program also finitely fails w.r.t. the other. Such a property is false when we 
consider the standard ground finite failure set. The proof of correctness of the non-ground finite 
failure semantics relies on the following result. First, a derivation is called non-perpetual if it is 
a fair infinite derivation and there exists a finite depth from which unfolding does not instantiate 
the original goal any more. Then the authors define the definite answer goal of a non-perpetual 
derivation as the maximal instantiation of the original goal. A crucial lemma states that any 
instance of the definite answer goal admits a similar non-perpetual derivation. Compared to our 
work, note that we do not need fairness as an hypothesis for our results. On the other hand, 
investigating the relationships between non-ground arguments of the definite answer and neutral 
arguments is an interesting problem. 

In |35| . the authors present a dynamic approach to characterize (in the form of a necessary 
and sufficient condition) termination of general logic programs. Their technique employs some 
key dynamic features of an infinite generalized SLDNF-derivation, such as repetition of selected 
subgoals and recursive increase in term size. 

Loop checking in logic programming is also a subject related to our work. In this area, |5| sets 
up some solid foundations. A loop check is a device to prune derivations when it seems appropriate. 
A loop checker is defined as sound if no solution is lost. It is complete if all infinite derivations 
are pruned. A complete loop check may also prune finite derivations. The authors show that even 
for function- free programs (also known as Datalog programs), sound and complete loop checks are 
out of reach. Completeness is shown only for some restricted classes of function- free programs. 

We now review loop checking in more details. To our best knowledge, among all existing 
loop checking mechanisms only OS-check jS2], EVA-check |H1] and VAF-check PH] are suitable for 
logic programs with function symbols. They rely on a structural characteristic of infinite SLD- 
derivations, namely, the growth of the size of some generated subgoals. This is what the following 
theorem states. 

Theorem 6.1 Consider an infinite SLD- derivation ^ where the leftmost selection rule is used. 
Then there are infinitely many queries Qi^, Qi2, ••• (with ii < i2 < ■ ■ ■ ) in ^ such that for 
O'lT-y j ^ 1; the selected atom Ai. of Qi is an ancestor of the selected atom A^.^j^ ofQi.^-^ and 

size{Ai._^J > size{Ai^). 

Here, size is a given function that maps an atom to its size which is defined in terms of the number 
of symbols appearing in the atom. As this theorem does not provide any sufficient condition to 
detect infinite SLD-derivations, the three loop checking mechanisms mentioned above may detect 
finite derivations as infinite. However, these mechanisms are complete w.r.t. the leftmost selection 
rule i.e. they detect all infinite loops when the leftmost selection rule is used. 

OS-check (for OverSize loop check) was first introduced by Shalin |^|31] and was then formal- 
ized by Bol ^. It is based on a function size that can have one of the three following definitions: 
for any atoms A and B, either size{A) = size(B), either .size{A) (resp. size{B)) is the count of 
symbols appearing in A (resp. B), either size{A) < size{B) if for each i, the count of symbols of 
the i-th argument of A is smaller than or equal to that of the i-th argument of B. OS-check says 
that an SLD-derivation may be infinite if it generates an atomic subgoal A that is oversized., i.e. 
that has ancestor subgoals which have the same predicate symbol as A and whose size is smaller 
than or equal to that of A. 

EVA-check (for Extented Variant Atoms loop check) was introduced by Shen 34 . It is based 
on the notion of generalized variants (if Gi and Gj {i < j) are two goals in an SLD-derivation, an 
atom A in Gj is a generalized variant of an atom A' in Gi if A is a variant of A' except for some 
arguments whose size increases from A' to A via a set of recursive clauses.) EVA-check says that 
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an SLD-derivation may be infinite if it generates an atomic subgoal A tliat is a generalized variant 
of some of its ancestor A' . Here the size function that is used apphes to predicate arguments, 
i.e. to terms, and it is fixed: it is defined as the the count of symbols that appear in the terms. 
EVA-check is more reliable than OS-check because it is less likely to mis-identify infinite loops 
|34| . This is mainly due to the fact that, unlike OS-check, EVA-check refers to the informative 
internal structure of subgoals. 

VAF-check (for Variant Atoms loop check for logic programs with Functions) was proposed by 
Shen et al. j36| . It is based on the notion of expanded variants. An atom A is an expanded variant 
of an atom A' if, after variable renaming, A becomes B that is the same as A' except that there 
may be some terms at certain positions in A' , each A'[i] . . .[k] of which grows in B into a function 
B[i] ... [fc] = /(... , A'[i] . . . [fc], . . . ) (here, we use A'[i] . . .[k] (resp. B[i] . . . [k]) to refer to the fc-th 
argument of . . . of the i-th argument of A' (resp. B)). VAF-check says that an SLD-derivation 
may be infinite if it generates an atomic subgoal A that is an expanded variant of some of its 
ancestor A' . VAF-check is as reliable as and more efficient than EVA-check |36|. 

The main difference with our work is that we want to infer atomic queries which are guaranteed 
to be left looping. Hence, we consider sufficient conditions for looping, in contrast to the above 
mentioned methods which consider necessary conditions. Our technique returns a set of queries 
for which it has pinpointed one infinite derivation. Hence, we are not interested in soundness 
as we do not care of finite derivations, nor in completeness, as the existence of just one infinite 
derivation suffices. Of course, using the A-subsumption test as a loop checker leads to a device that 
is neither sound (since A-subsumption is a particular case of subsumption) nor complete (since 
the A-subsumption test provides a sufficient but not necessary condition). This is illustrated by 
the following example. 

Example 6.2 Let c :— p{X,X) ^- p{f{X),f{X)). As the arguments of the head of c have one 
common variable X , every set of positions with associated terms t+ that is DN for {c} is such 
that the domain ofr'^ijf) is empty (see (DNl) in Definition \S.H5]) . Notice that from the head 
p{X, X) of c we get 

pix, X) =^p{f{x)j{x)) ^ • • • =^p{f^{x), r (X)) =^ • • ■ 

c c c c 

As the arguments of p grow from step to step, there cannot he any query in the derivation that is 
A[r"'"]-more general than one of its ancestors. Consequently, we can not conclude that p{X,X) 
left loops w.r.t {c}. D 

On the other hand, using loop checking approaches to infer classes of atomic left looping queries 
is not satisfactory because, as we said above, non-looping queries may be mis-identified as looping. 

Example 6.3 We cannot replace, in Corolla,rv \S.S\ the subsumption test by the expanded variant 
test used in VAF-check because, for instance, in the clause c :— p{a) <—p{f{a)), we have: p{f{a)) 
is an expanded variant ofp{a), hut p{a) does not loop w.r.t. c. 

Finally, ^U] is also related to our study. In this paper, the authors describe an algorithm for 
detecting non-terminating queries to clauses of the type p{- ■ ■) <— p{- ■ ■). The algorithm is able to 
check if such a given clause has no non-terminating queries or has a query which either loops or fails 
due to occur check. Moreover, given a linear atomic goal {i.e. a goal where all variable occurs at 
most once), the algorithm is able to check if the goal loops or not w.r.t. the clause. The technique 
of the algorithm is based on directed weighted graphs ^^ and on a necessary and sufficient 
condition for the existence of non-terminating queries to clauses of the type p{- ■ ■) ^ p{- ■ ■)■ This 
condition is proved in |H] and is expressed in terms of rational trees. 

7 Conclusion 

We have presented a extension of the subsumption test which allows to disregard some arguments, 
termed neutral arguments, while checking for subsumption. We have proposed two syntactic 
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criteria for statically identifying neutral arguments. From these results, in the second part of this 
report we have described algorithms for automating non-termination analysis of logic programs, 
together with correctness proofs. Finally, we have applied these techniques to check the optimality 
of termination conditions for logic programs. 

This paper leaves numerous questions open. For instance, it might be interesting to try to 
generalize this approach to constraint logic programming 19^. Can we obtain higher level proofs 
compared to those we give? Can we propose more abstract criteria for identifying neutral argu- 
ments? A first step in this direction is presented in 29 . Also, our work aims at inferring classes 
of atomic left looping queries, using a bottom-up point of view. Experimental data show that it 
may sometimes lead to prohibitive time/space costs. How can we generate only the useful binary 
clauses without fully computing the iterations of this Tp-like operator? Or can we adapt our 
algorithms towards a more efficient correct top-down approach for checking non-termination? 
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A Algorithms 

First, we define a pre-order relation over sets of positions with associated terms. Such a relation 
is useful for the design of the algorithms that we present in the sequel of this section. 

Definition A.l (^ and r^ax) 

• Ti ^ T2 if for each p G XI,. Dom{Ti (p)) C Dom{T2 {p)) and for each i G Dom{Ti{p)), 



^+ 



{p){i) is more general than T^(p){i). 



• '''max denotes a set of positions with associated terms s.t. Dom(T^^^{p)) — [1, arity{p)] for 
each p G n and T+^^(p)(i) is a variable for each i G [1, arity{p)]. 

A.l DN Sets of Positions with Associated Terms for Binary Programs 

We present below an algorithm for computing DN sets of positions with associated terms. 

dna.(BinProg , t^)'- 
in; BinProg: a finite set of binary clauses 



out 

1 
2 
3 

4 
5 
6 

7 



a set of positions with associated terms 
s.t. T2 ^ T^ and T2 is DN for BinProg 

= rt 

— satisfyj:)Nl{BinProg,T2) 
= satisfy_DN2( BinProg , Tj*" ) 
= satisfy_DN3( BinProg , T2 ) 

while satisfy_DN4(_BmPro(7,T2'") ^ T2 do 
T2 := satisfy_DN4(SmProg,r2^) 

return t^ 



The algorithm dna calls four auxiliary functions that correspond to conditions (DNl), (DN2) 
(DNS) and (DN4) in the definition of a DN set of positions with associated terms (see Defini- 
tion |^2S|)- These functions are detailed below. 

After T2 '■= ScLtisfyJ)Nl{BinProg,T2) at line 2 of dna, T2 satisfies item (DNl) of Defini- 
tion 1^3 

satisfy_DNl (SmPro^, t^): 

1. T2 .— Tj 

2: for each p{si, . . . , Sn) *— S G BinProg do 

E -.^ {ie [1, n] I Var{s,) n Var{{sj \ j ^ i}) = 0} 



-{p):=T+{p)\(Dom{T+{p))nE) 



return r. 



After T2 := s at i sf y_DN2 ( BmPro;?, r2^) at line 3 of dna, T2 satisfies item (DN2) of Dcfini- 
tion l3.3!3l Notice that less_general at line 5 of satisfyJDN2 is a function that returns the less 
general term of two given terms; if none of the given terms is less general than the other, then 
this function returns undefined. 
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■ , Sn) ^ B E BinProg do 



satisfy_DN2(5mPro5, t+) 

1: T+:=T+ 

2: for each p(si, 

for each i G Dom{T2 (p)) do 
Wi := less_general(si, Tj (p)(i)) 
if Wi = undefined then F := F U {i} 
else T2 {p){i) := u^ 

.+ 
2 



^2+b):=^2+(p)l(^o"^(^2+(p))\^) 
return Tn 



After 



:= s at i sf y_DN3 ( SmPro(7, r2) at line 4 of dna, T2 satisfies item (DNS) of Defini 



tion 13.351 The function satisfy_DN3 is detailed below. 

satisfyJDN3(i?mPro5, t^): 

1: r}:=rt 

for each H ^- q{ti, . . . , i,n) G BinProg do 
F := 

for each i e Dom{T2 {qj) do 
if T^ ((?)(«) is not more general than t^ then F := F U {i} 



'-2+(9):=^2+(9)l(^om(r2+(g))\F) 



return t. 



Finally, the function satisf y_DN4 is defined as follows. After line 6 of dna, the set of positions 
with associated terms T2 satisfies item (DN4) of Definition 13.351 



satisfy_DN4(BmProff, t^): 


1 


^2 •— ^1 


2 


for each p{si, . . . , s„) ^ q{ti, . . . , tm) g BinProg do 


3 


F :=0 


4 


for each i e Dom{T2 {p)) do 


5 


for each j G [l,m] \ Dom{T2 (q)) do 


6 


if Var(s,) n Far(ij) / then F := F U {i} 


7 


T2+(p):-T2+(p)|(i?om(T2+(p))\F) 


8 


return Tj'^ 



Proposition A. 2 (Correctness of dna) Let BinProg be a binary program and t^ be a set of 

positions with associated terms. 

1. satisfy_DNl{BinProg^T^), ..., satis fyJ)N4{BinProg,T^) terminate; 

2. satis fyJ)Nl{BinProg,T^) =^ r+, .... satis fy_DN4{BinProg ,t^) ^ t+; 

3. dna[BinProg,T^) terminates; 

4. dna{BinProg , t^) =4 ^i (md dna{BinProg , t^) is a set of positions with associated terms that 
is DN for BinProg . 

Proof. We have: 

1. satisf y_DNl(i?mPro5, Tj^) terminates because, as BinProg is a finite set of binary clauses, 
the loop at lines 2-4 terminates. Concerning satisf y_DN2-4, the inner loops terminate since 
for each p G 11, Dom{T2 {p)) Q [l,arity{p)\ and the outer loop terminates as BinProg is a 
finite set of binary clauses. 
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2. • satisfy_DNl(5mPro5,Tj^) =^ Tj^: 

Line 1, we start from t^ . Line 4, we have for each relation symbol p from the heads of 
the clauses of BinProg: 

Dom{T+{p)) C Dom{T+{p)) and V^ G Dom{T+{p)), t+ {p){i) = r+(p)(i) . 

Hence, when we reach line 5, we have: s at i sf y_DN 1 (iJmPro^, Tj^) =:^ t^ . 

• satisfy_DN2(i?mPro5,Ti+) =<; t^ : 

Line 1, we start from t^ . Then, for each relation symbol p from the heads of the 
clauses of BinProg and for each i e Dom{T2{p)), either T2{p){i) is set to a less general 
term than Tj (p)(i) (line 7) or i is removed from the domain of T2 (p) (lines 6 and 8). 
Therefore, when we reach line 9, we have: s at i sf y JDN2 ( SmPro;?, Tj^^) =^ t^ . 

• satisfy_DN3(5mPro5,Tj^) =^ t^ : 

Line 1, we start from t^ . Line 6, we have for each relation symbol q from the bodies 
of the clauses of BinProg: 

Dom{T+[q)) C Dom{T+[q)) and Vz £ Dom{T+{q)), t+ [q){i) = t+ [q){i) . 

Hence, when we reach line 7, we have: satisfy_DN3(i?mPro5, r^^) =<; t^ . 

• satisfy_DN4(i?mPro5,Tj^) =^ t^: 

Line 1, we start from t^ . Line 7, we have for each relation symbol p from the heads of 
the clauses of BinProg: 

Dom{T+{p)) C Dom{T+{p)) and Vi E Dom{T+{p)), T+{p){i) = r+(p)(i) . (2) 

Hence, when we reach line 8, we have: satisfy_DN4(i?mPr'05, Tj^) =:^ t^ . 

3. Each call to satisfy_DNl, . . . , satisf y_DN4 terminate. Moreover, concerning function 
satisfy_DN4, we mentioned above that (0) holds. As C is a well-founded relation over 
the set of sets, the loop at lines 5-6 terminates. 

4. Line 1, we start from t^ . Then satisfy_DNl, . . . , satisfyJDN4 weaken t^ with respect to 
Definition 13.351 When we reach the fixpoint, the property holds. D 

A. 2 Loop Dictionaries 

A. 2.1 Proof of Proposition 14731 Dage ll4l 

We proceed by induction on the length n of BinSeq. 

• Basis. If n = 0, then, as {[H ^- B],t'^) is a looping pair, B is A[T+]-more general than H 
and T+ is DN for H ^ B, i.e. A[r+] is DN for H ^ B by Theorem lOUl Consequently, by 
Proposition 13 .201 H loops w.r.t. [H ^ B]. 

• Induction. Suppose that for an n > 0, each looping pair ([H ^- B\BinSeq], r+) with BinSeq 
of length n is such that H loops w.r.t. [H ^- B\BinSeq]. 

If BinSeq is of length n + 1, it has form [Hi ^ Bi\BinSeqi\ with BinSeqi of length n. 
Moreover, as {[H ^- B\BinSeq],T^) is a looping pair, there exists a set of positions with 
associated terms t^ such that {[Hi <— Bi[BinSeqi],T^) is a looping pair and B is A[rj^]- 
more general than Hi. So, by the induction hypothesis. Hi loops w.r.t. [Hi <— Bi[BinSeqi] 
i.e. Hi loops w.r.t. BinSeq. As B is A[T-|^]-more general than Hi and A[t-|^] is DN for 
BinSeq, by Definition 13.181 i? loops w.r.t. BinSeq. Therefore, by CoroUarv 13.31 H loops 
w.r.t. [H ^ B[BmSeq]. 
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A. 2. 2 Computing a Loop Dictionary 

The top-level function for inferring loop dictionaries from a logic program is the following. It uses 
the auxiliary functions uiiit_loop and loops_f rom_dict described below. 



inf er_loop_dict(P, max): 

in: P: a logic program 

max: a non- negative integer 

a loop dictionary, each element [BinSeq, r^ 



out: 

1 
2 
3 
4: 

5 



of which 



is such that BinSeq C Tp ] max 

Diet := 

for each H ^ B eT^p] max do 
Diet := unit_loop(i7 ^- B, Diet) 
Diet := loops Jrom_dict(i7 ^- B, Diet) 

return Diet 



The function unit_loop is used to extract from a binary clause the most simple form of a 
looping pair: 

unit_loop(iy ^ B, Diet): 

in: H ^ B: & binary clause 
Diet: a loop dictionary 
out: Diet': a loop dictionary, every element {BinSeq, t^) of which is 
such that either (BinSeq, r"*") € Diet or BinSeq = [H ^ B] 

Diet' := Diet 

r+:=dna([i7^B],r+,J 

if B is A[r+]-more general than H then 



Diet' :== Diet' U {{[H 
return Diet' 



B] 



-)} 



Termination of unit_loop relies on that of dna. Partial correctness is deduced from the next 
theorem. 

Theorem A. 3 (Partial eorreetness o/unit_loopJ Let H ^ B be a binary elause and Diet be a 
loop dietionary. Then uiiit_loop(i7 <— B,Diet) is a loop dietionary, every element {BinSeq, t'^) 
of whieh is sueh that either {BinSeq, r+) G Diet or BinSeq = [H '^ B]. 

Proof. Let r+ be the set of positions with associated terms computed at line 2. If B is not 
A[T"'"]-more general than H then, at line 5 of unit_loop, we have Diet = Diet, so the theorem 
holds. 

Now suppose that B is A[T+]-more general than H. Then, at line 5 we have Diet' := Diet U 
{{[H ^- B],T'^)} where Diet is a loop dictionary, t^ is DN ior H ^ B and B is A[T+]-more 
general than H. So at line 5 Diet is a loop dictionary, every element {BinSeq, t^) of which is 
such that either (BinSeq, r+) G Diet or BinSeq ~ [H ^ B]. 

The function loops_f rom_dict is used to combine a binary clause to some looping pairs that 
have already been infered in order to get some more looping pairs. 
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loops jfrom_dict(i7 ^ B, Diet): 

in: H ^^ B: a binary clause 
Diet: a loop dictionary 
out: Diet : a loop dictionary, every element {BinSeq,T^) of which is 
such that (BinSeq,T'^) € Diet or BinSeq = [H ^- B\BinSeqi\ 
for some {BinSeqi,T^) in Diet 

1: Dzci' := Diet 

2: for each {[Hi ^ Bi\BinSeqi\,T^) £ Dzci do 

3: if S is A[r]^]-more general than Hi then 

4: r+ := dna([iJ ^ B,Hi^ Bi\BinSeqi],T+) 

5: Dici' := Diet' U {{[H ^ B , Hi ^ Bi\BinSeq^],T+)} 

6: return I?ici' 

Termination of loopsjfrom_dict follows from finiteness of Diet (because Diet is a loop dictio- 
nary) and termination of dna. Partial correctness follows from the result below. 

Theorem A. 4 (Partial eorreetness o/ loops jfrom_dictj Let H ^ B be a binary clause and Diet 
be a loop dietionary. Then loops_f rom_dict(_ff <— B,Diet) is a loop dictionary, every element 
(BinSeq, T^) of which is such that {BinSeq, t^) £ Diet or BinSeq = [_ff ^ B\BinSeqi\ for some 
{BinSeqi,T^) in Diet. 

Proof. Upon initialization at line 1, Diet is a loop dictionary. Suppose that before an iteration of 
the loop at hne 2, Diet' is a loop dictionary. Let {[Hi ^- Bi[BinSeqi],T^) G Diet. 

If the condition at line 3 is false, then Diet remains unchanged, so after the iteration Diet 
is still a loop dictionary. Otherwise, the pair {[H <— B,Hi ^- Bi[BinSeqi],T^) is added to 
Diet' at line 5. Notice that this pair is a looping one because r"*" defined at line 4 is DN for 
[H -I— B,Hi <— Bi\BinSeqi] and {[Hi ^- Bi[BinSeqi],T^) is a looping pair and B is A[rj^]-more 
general than Hi. Therefore, after the iteration. Diet' is a loop dictionary. Finally, notice that as 
Diet is a finite set, the loop at line 2 terminates. Hence, at line 6 Diet' is a finite set of looping 
pairs i.e. Diet is a loop dictionary. 

Moreover, at line 1, each element of Diet' belongs to Diet. Then, during the loop, pairs of form 
{[H ^- B[BinSeqi],T^) are added to Diet' where BinSeqi is such that there exists {BinSeqi,T^) G 
Diet. Consequently, at line 6 each element {BinSeq, t+) of Diet is such that either {BinSeq, t+) G 
Diet or BinSeq — [H '^ B\BinSeqi] for some {BinSeqi,T^) in Diet. 

Finally, here is the correctness proof of the function inf er_loop_dict. 

Theorem A. 5 (Correctness of inf er_loop_dictJ Let P be a logic program and max be a non- 
negative integer. Then, inf er_loop_dict(P, max) terminates and returns a loop dictionary, every 
element {BinSeq, r+) of which is sueh that BinSeq C Tp | max. 

Proof. At line 1, Diet is initialized to which is a loop dictionary. Suppose that before an 
iteration of the loop at line 2, Diet is a loop dictionary. Then at lines 3 and 4 unit_loop and 
loops Jrom_dict fuUfil their specifications. Hence, the call to these functions terminates and after 
the iteration Diet is still a loop dictionary. Finally, as Tp "f max is a finite set, the loop at line 2 
terminates and at line 5 Diet is a loop dictionary. 

Moreover, at line 1 each element {BinSeq, r+) of Diet is such that BinSeq C Tp | max. Then, 
during the loop, unit_loop and loops_f rom_dict are called with clauses from Tp f max. So, by 
Theorem I A . 31 and Theorem lA.4l after the iteration each element {BinSeq, r+) of Diet is such that 
BinSeq C Tp } max. 

A. 3 Looping Conditions 

The following function computes a finite set of looping conditions for any given logic program. 
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inf er_loop_cond(P, max): 

in: P: a logic program 

max: a non- negative integer 
out: a finite set of looping conditions for P 

1: L:=0 

2: _Dict := inf er_loop_dict(P, TTiaa;) 

3: for each {[H ^ B\BinSeq],T+) e Diet do 

4: L:^LU{{H,T+)} 

5: return L 

A call to inf er_loop_cond(P, max) terminates for any logic program P and any non-negative 
integer m,ax because inf er_loop_dict(P, 77202;) at line 2 terminates and the loop at line 3 has a 
finite number of iterations (because, by correctness of inf er_loop_dict, Diet is finite.) Partial 
correctness of inf er_loop_cond follows from the next theorem. 

Theorem A. 6 (Partial correctness o/inf er_loop_condJ Let P be a logic program and max be a 
non-negative integer. Then inf er_loop_cond(P, max) is a finite set of looping conditions for P. 

Proof. By correctness of inf er_loop_dict, Diet is a loop dictionary. 

Let {[H ^- B\BinSeq],T^) e Diet. Then {[H ^- B\BinSeq],T^) is a looping pair. Conse- 
quently, by Proposition 14.31 H loops w.r.t. [H ^- B\BinSeq]. As t+, hence A[r"''], is DN for 
[H ^- B\BinSeq], by Definition 13.181 everv atom that is A[r+]-more general than H loops w.r.t. 
[H ^ B\BinSeq]. 

As {[H ^ B\BinSeq],T+) G Diet, by Theorem [A. 51 we have 

[H ^- B\BinSeq] C Tp 1 max C bin_unf{P) . 

So, by Theorem l2.1l H left loops w.r.t. P and every atom that is A[r+]-more general than H left 
loops w.r.t. P. So, {H,T^) is a looping condition for P. Consequently, at line 5, L is a finite set 
of looping conditions for P because, as Diet is finite, the loop at line 3 iterates a finite number of 
times. 

B Proofs 

B.l Two Useful Lemmas 

Lemma B.l Let c := H ^ B be a binary elause. Then, for every variant cy of c such that 
Varlcy) fl Var{H) = 0, we have H ^^ Bj' where 7' := 7I Var{B) \ Var{L[). 

c 

Proof. Let jjl :~ {x^/x\x G Var{H)}. By Claim lR2l below, /i is an mgu of H^ and H . Hence, as 
Var{c'y) n Var{H) — 0, we have the left derivation step H =^ B^fi where cy is the input clause 

c 

used. 

If Var{B) = 0, then Bj^i = Bj', so we have H ^ B^' i.e. H =^ B^' . 

c c 

If Var{B) ^ 0, take a variable x G Var{B): 

• if X G Var{H), then x{'y^) — {xj)ii = a; by definition of fi, 

• if X ^ Var{H), then x{"/ii) = {xj)fi = xj by definition of fi. 
Hence, Bjjj, — B-f', so we have H =^ P7' i.e. H =^ B-f'. 

c c 

Claim B.2 /i is an mgu of LL^ and H. 
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Proof. Let p{si, . . . , s„) := H. The set of unifiers of Hj and H is the same as that of Ei :— 
{si7 = si, . . . , s„7 = s„}. Let E2 := {2:7 = x | x G Var(_ff)}. Notice that, as 7 is a renaming, 
if a;,y £ Var{H) then x ^ y ^ xj ^ 2/7. Moreover, for each x G Var(H), xj ^ x because 
Var(c7) fl Var{H) — 0. So, _E2 is solved. Consequently, by Lemma 2.15 page 32 of PP, fj, is an 
mgu of E2- Notice that, by Claim lR3l below, the set of unifiers of Ei is that of E2- So /i is an 
mgu of El i.e. 11 is an mgu of H"f and H. 

Claim B.3 Ei and E2 have the .same .set of unifiers. 

Proof. Let 9 he a. unifier of Ei. Let x G Var{H) and let i G [!,"] such that x E Var{si). Then 
5^76* — SiO, so, if Xk is an occurrence of x in Si, we have Xk^9 = x^O i.e. {xkj)0 — XkO. As x 
denotes any variable of H, we conclude that 9 is a unifier of E2. Conversely, let 6* be a unifier of 
E2. Then, for each i G [l,n], (5^7)6* = Si9 by definition of E2. Hence, is a unifier of Ei. 

Lemma B.4 Let c :~ H <— B he a binary clause, 07 be a variant of c such that Var(cj) H 
Var(H) — andy :— 7I Var{B)\ Var{H). Then, there exists a renaming ^" such that Bj' = Bj" . 

Proof. Let A := {x \ x G Ran{'~f') and x ^ Dom{'y')} and B := {x \ x G Dom{'j') and x ^ 
Ran{j')}. Notice that Ran{'y') and Dom{'-f') have the same number of elements, so A and B have 
the same number of elements. Let cr be a 1-1 and onto mapping from A to B. Then, 7" := 7' U cr 
is a well-defined substitution, is such that Dom{j") ~ Ran{'j"), is 1-1 and is onto. Consequently, 
7" is a renaming. 

Now, let us prove that B-f' — Bj". If Var{B) — 0, then the result is straightforward. 
Otherwise, let x G Var{B). 



• 



• 



If X e Var{H) then, as Dom{'-f') C Var{B) \ Var{H), we have x ^ Dom{'-f'), so x'j' = x. 
Moreover, xj" = x{'-f'Ua) = xa = x because Dom{a) C Ran{'-f') and Ran{j')n Var{H) = 0. 
Consequently, we have X7' = X7". 

If a; ^ Var{H) and x G Dom{'-f') then 1-7" = x{j' U cr) = xj'. 

• If a; ^ Var{H) and x ^ Dom(y) then xj' = x. 

Now, suppose that x G Dom{a). Then, as Dom(a) C Ran{j') C Ran{j), we have x G 
Ran^-y). But, as 7 is a renaming, Ran('j) — Dom{'^), so we have x G Dom{j). As 
X £ Var(i3), as x ^ Var{H) and as 7' := 'y\Var{B) \ Var{H), we have x G Dom{y). 
Contradiction. 

Consequently, x ^ Dom{a), so xa — x and X7" = x(7' U cr) = xct = x. Finally, we have 
proved that X7' — X7". 

B.2 Proof of Corollary EI21 page H 

By Lemma |B . 1 1 and Lemma |B. 41 we have H =^ Bj" where 7" is a renaming. As by hypothesis 

c 

B is more general than H, then Bj" is more general than H. Therefore, by the One Step Lifting 
Lemma [3. II H loops w.r.t. {c}. 



B.3 Proof of Corollary EI31 page H 



By Lemma IB. II and Lemma IB. 41 we have H => Bj" where 7" is a renaming. As Bj" is more 

c 

general than B and as B loops w.r.t. P, then, by the One Step Lifting Lemma f3. II Bj" loops 
w.r.t. P, so H loops w.r.t. P. 
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B.4 Proof of Proposition 13.161 page [7| 

If A is A-more general than B, we have, for a substitution tj: 

A =_p(si,. .. ,s„) 

B=p{ti,...,tn) 

Vz e [1, n] \ Dom{A{p)), U = SiT] 

Vi e Dom{A{p)), A(p)(i)(si) = true. 

Let A' be a variant of A. Then, there exists a renaming 7 such that ^' — A7. As for each 
i e _Doto(A(p)), A(p)(j) is a variant independent term-condition, we have: 

A' = _p(si7, . . . , s„7) 

B=|7(il,...,t„) 

Vi e [l,n] \ Dom{A{p)), U = Si-q = (s,:7)(7~"^?7) 
Vi e Dom{A{p)), A{p){i){si^) = true . 

Consequently, A' is A-more general than B for ^~^rj, i.e. A' is A-more general than B. 
B.5 Proof of Proposition IH^. 171 page 171 

<= By definition. 

=^ Let p{si, . . . , Sn) '■= A and p(ii, . . . , tn) := -B. As A is A-more general than B, there exists a 
substitution a such that A is A-more general than B for a. Notice that A is also A-more 
general than B for the substitution obtained by restricting the domain of a to the variables 
appearing in the positions of A not distinguished by A. More precisely, let 

T] := a\Var{{s^ | i £ [1, n] \ Dom.{A{p))}) . 

Then, 

Dom{ri) C Var{A) (3) 

and A is A-more general than B for rj. 

Now, let X G Dom{ri). Then, there exists i G [1,«] \ Dom{A{p)) such that x G yar(si). 
As A is A-more general than B for ry and i G [Ij?^] \ i^ow(A(p)), we have ti — Siij. So, as 
X G yar(si), XT? is asubtermof ti. Consequently, Var{xri) C yar(ii), so yar(a;?7) C Var{B). 

So, we have proved that for each x G Dom^q), Var{xvi) C yar(i3), i.e. we have proved that 

Ran{-q) C yar(B) . (4) 

Finally, (0) and Q imply that Dom{ri) U Ran{ri) C V^ar(74, i?) i.e. that 

yar(?7) C yar(yl, B) . 

B.6 Proof of Proposition l^.2()|. page 151 

By Lemma FB . 1 1 and Lemma [6.41 wc have H =^ Bj" where 7" is a renaming. As by hypothesis B 

c 

is A-more general than H, then by Proposition 13. 161 B^" is A-more general than H. Therefore, 
as A is DN for c, by Definition 13. 181 H loops w.r.t. {c}. 

B.7 Proof of Proposition I3.37J. page [TT] 

Let c := p{si, . . . ,s„) ^- q{ti, . . . ,*„) and c' := p{s[, . . . ,s'j) ^ q{t[, . . . ,i^) be a variant of c. 
Then, there exists a renaming 7 such that c' = 07. 
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(DNl) Let i e Dom{T^{p)). Suppose that there exists j ^ i such that Var{s'^) n Var{s'A ^ 
and let us derive a contradiction. 

Let x' G Var{s'^) n Var{s'j). As s'^ = Sjj, there exists x G Var{sj) such that x' — xj. 

For such an x, as j ^ i and as Var{si) fl yar(sj) = (because t+ is DN for c), we 
have X ^ Var{si). So, as 7 is a 1-1 and onto mapping from its domain to itself, we have 
xj ^ Var{sij)^ , i.e. x' ^ Var{s[). Contradiction! 

Consequently, Var{s[) n Var{s'j) — 0. 

(DN2) Let {i ^^ Ui) e T+(p). As s^ is more general than Ui (because r+ is DN for c) and as s[ 
is a variant of .s^, sj- is more general than Ui. 

(DNS) Let (j >-^ Uj) G T+((3'). As ij is an instance of Uj (because r+ is DN for c) and as i' is a 
variant of ij , i'- is an instance of Uj . 

(DN4) Let i G Dom{T^{p)). Suppose there exists j ^ Dom{T^{q)) such that Far(s^)nFar(t^) / 
0. Let us derive a contradiction. 

Let cc' G Var{s'^)r\Var{t'j). As i^- = ij7 and x' G V'ar(i^), there exists x G Var(ij) such that 
x' = xj. For such an a;, as the elements oiVar{si) only occur in those tk s.t. fc G Dom{T^{q)) 
(because t+ is DN for c) and as a; G yar(tj) with j ^ Doni{T^{q)), we have x ^ T^ar(si). 
So, as 7 is a 1-1 and onto mapping from its domain to itself, we have xj ^ Var^Sij) (see 
footnote 13, i.e. x' ^ Var{s'^). Contradiction! So, for each j ^ Dom{T~^{q)), we have 
Var{s'^)nVar{t'j) =0. 

Finally, we have established that r+ is DN for c'. 

B.8 DN Sets of Positions with Associated Terms Generate DN Filters 

In this section, we give a proof of Thcorcm l3.39l page 1 121 

B.8.1 Context 

All the results of this section are parametric to the following context: 

• P is a binary program and r+ is a set of positions with associated terms that is DN for P, 

• Q => Qi is a left derivation step where 

c 

- ceP, 

- Q :=p(ti,...,i„), 

- Ci := p(si, . . . , Sn) <— i? is the input clause used (consequently, ci is a variant of c that 
is variable disjoint from Q), 

• Q' := p(ii, . . . ,t'^) is A[T+]-more general than Q i.e., by Proposition 13.171 there exists a 
substitution 77 such that Var{rj) C Var{Q,Q') and Q' is A[T+]-more general than Q for 77. 



^ Because if 0:7 S yar(si7), then either x G V'ar(si), either 0:7 S yar(si) and (2:7)7 = ^'7- The former case is 
impossible because we said that x Var{si). The latter case is impossible too because (2:7)7 = x-y implies that 
xj Dom{'y) i.e. x Dom{f) (because 7 is a 1-1 and onto mapping from its domain to itself); so, x = xf i.e., as 
xj £ Var{.Si), x € Var(si). 



36 



B.8.2 Technical Definitions and Lemmas 

Definition B.5 (Technical Definition) Let c'l :— p{s'i, . . . , s^) ^ B' he a binary clause such that 

• Var{c\) n Var{Q, Q') = and 

• Ci = 0^7 for some renaming 7 satisfying Var{'^) C Var{ci,c'-^). 

As c'l is a variant of ci and ci is a variant of c, then c'l is a variant of c. Moreover, as t+ 
is DN for c, by Proposition \3.3T[ t+ is DN for c'l . So, by (DN2) in Definition \3.35\ for each 
{i <—>■ Ui) G T^(p) there exists a substitution 5i such that Ui = s[5i. 

Moreover, as p{t'i, . . . , t^) is A[r+]-more general than p(ti, . . . , t„), for each {i ^^ Ui) G T+(p), 
t[ is an instance of Ui. So, there exists a substitution 5[ such that t[ — UiS[. 

For each i G Dom{T~^{p)), we set 

a/il {SMVaris'^ . 
Moreover, we set: 

def I I 

cr = [J (7i . 



Lemma B.6 The set a of Definition XB .m is a well-defined substitution. 

Proof. Notice that, as t+ is DN for c'^, by (DNl) in Definition 13.351 we have 

Vi e Dom{T+{p)), yj e [l,n] \ {i}, Var{s'^ n Var{s'j) = . 

Consequently, 

Vi, j G Dom{T^{p)), i ^ j ^ Dom[ai) fl Dom{aj) — . 

Moreover, for each i G Dom{T^{p)), ai is a well-defined substitution. So, cr is a well-defined 
substitution. 

Lemma B.7 (Technical Lemma) Let c'l :— p{si, . . . , s'^) ^ B' he a binary clause such that 

• Var{c'i) n Var{Q, Q') = and 

• ci — c']^7 for some renaming 7 satisfying Var^^^) C Var{ci,c'i). 



Let a be the substitution of Definition |j?.5l Then, the substitution arj^O is a unifier of 
p{t[,...,t'J andp{s[,...,s'J. 

Proof. The result follows from the following facts. 

• For each {i t^ Ui) G T+(p), we have: 

S^CT = S^CTj = s[5^5[ = {,s[6i)6[ = u^5[ = t'i 

and t'^a = t[ because Dom{a) C Var{c'i) and Var{Q') n Var{c[) = 0. So, s'cr = i'cr and 
s[arjj0 = t'^arjjO. 

• For each i G [1, n] \ Dom{T^{p)), we have: 

4vid = is'tVhS = s'aO = (s',7)6' = 5,6* 

and 

t^md = (i:r,)70 = i,70 = (t,7)^ = i^^ 

and SiO = tiO because 6* is a unifier of p(si, . . . , s„) and p(ii, . . . , tn) (because Q =^ Qi with 

c 

ci as input clause used). So, 

■<VlO = t^r,7^ (5) 
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• For each i £ [l,n]\ Doni{T^(p)), we also have: 



— s'^a = s'^ because Dom{a) = Var(^{s'j \ j G Dom{T^ (p))}) and, by (DNl) m Dcfini- 
tioiiESni Var{{s'^ I j e Dom{T+{p))}) nVar{s'^) = 0; 

- t'^a = t[ because Dom{a) C Var{c[) and Var{Q') n Var{c[) = 0. 

Therefore, because of (jHl, s'^arj'^O — t'^a-q^O. 

B.8.3 A-Propagation 

Now we extend, in the case of left derivations with atomic queries and binary clauses, the following 
Propagation Lemma that is proved by Apt in 1 p. 54-56. 

Lemma B.8 (Propagation) Let G, Gi, G' and G'l be some queries such that 

G=>Gi and G'=^G[ and 

c c 

• G is an instance of G' 

• in G and G' atoms in the same positions are selected. 
Then, Gi is an instance of G'l . 

First we establish the following result. 

Lemma B.9 Suppose there exists a left derivation step of form Q' ^=^ Q'l where the input clause 

c 

is c'l such that Var{Q) fl Var(c'i) = 0. Then, Q'l is A[T+]-more general than Qi. 

Proof. Notice that we have 

Var{Q) n Var{ci) = Var{Q, Q') n Var{c[) = . 

Moreover, as ci is a variant of c\ , there exists a renaming 7 such that 

Var{'-^) C Var{ci,c'i) and ci = c'^'y . 

Let c'l := p(s;, . ..,s'^)^ B' . Then, 

Qi = BO and Qi ^ B'O' . 

T+ is DN for c and c^ is a variant of c. So, by Proposition 13.371 t+ is DN for c^. Let a be the 
substitution of Definition IB. 51 

Let q[v[, . . . ,i;^jj := B' . As B = B'^, B has form q{vi, . . . ,Vm)- 

• For each {j i-^ Uj) G T^{q), v'j is an instance of Uj (because t+ is DN for c^ and (DN3) in 
Definition 13.351 ) 

• For each j £ [^,rn] \ Dom{T^{q)) we have: 

v'jarjjB — {v'ja)rj^6 — v'^rj^O 
because, by (DN4) in Definition 13. 351 

Var{v'j) n Var{{s[ \ i £ Dom{T+{p))}) = 
with Do7n{a) = Var(^{s^ \ i e Dom{T^{p))}y Moreover, 

v'jii'j9 = {Vjrj)j6 = Vjj9 
because Varii^) C Var{Q,Q') and Var{c[) n Var{Q,Q') = 0. Finally, 

because B = B'^. 
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Consequently, we have proved that 

q{v'i, . . . , v'^) is A[T+]-more general than q{vi, . . . , Vm)S for arj'yd 

i.e. that B' is A[T+]-more general than B9 for (77776' i.e. that 

_B' is A[T^]-niore general than Qi for 0-7776' . (6) 

But, by the Technical Lemma [6.71 CT7/76' is a unifier oi p{s'i, . . . , s'^) and p{t'i, . . . , t'„). As 9' is an 
mgu of p{s'i, . . . , s'j) and p{t[, . . . , i'„) (because Q' ^=^ Q'l with c'^ as input clause), there exists 6 

c 

such that cr7;76' = d'6. Therefore, we conclude from (jHJl that B' is A[r+]-more general than Qi 
for 9'6 which implies that B'9' is A[T"'"]-more general than Qi for S i.e. that Q[ is A[r+]-more 
general than Qi for (5. Finally, we have proved that Q'l is A[T+]-more general than Qi. 
Using the Propagation Lemma FB. 81 the preceding result can be extended as follows. 



Proposition B.IO (A- Propagation) Suppose there exists a left derivation step Q' ^=^ Q'^. Then 

c 

Q'l is A[T+]-77iore general than Qi. 

Proof. Let c[ be the input clause used in Q' =^ Q[ . Take a variant Q" of Q such that 

c 

Var{Q") r\Var{c[) = 

and a variant c'/ of c such that 

Var{c'l) n Var{Q") = . 

Then, the left resolvent Q'( of Q" and c exists with the input clause c". So, for some 9" , we have 
Q"^=^Qi with input clause c'{. Consequently, we have: 

c 

Q=^Qi and Q"=^Q'l. 

c c 

Q and Q" are instances of each other because Q" is a variant of Q. So, by the Propagation 
Lemma IB . 81 used twice, Q'l is an instance of Qi and Qi is an instance of Q". So, 

Q'l is a variant of Qi . (7) 

But we also have 

Q"^Q'l and Q'^Q'^ 

c c 

with input clauses c" and c'j^, with Q' that is A[r+]-more general than Q" (because Q" is a variant 
of Q and Q' is A[T+]-more general than Q) and Var{Q") fl V^ar(c'j) = 0. So, by Lemma lR9l 

Q'l is A [t^] -more general than Q'l . (8) 

Finally, from ^ and © we have: Q'l is A[r+]-more general than Qi. 

B.8.4 Epilogue 

Theorem 13. 391 is a direct consequence of the following result. 

Proposition B.ll (One Step A-Lifting) Let c' he a variant of c variable disjoint with Q' . Then, 
there exist 9' and a query Q'l that is Afr+l-more general than Qi such that Q' =^Q'i with input 

c 

clause c' . 
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Proof. Let c[ :— p{s'i, . . . , s^) ^ B' he a variant of ci. Then there exists a renaming 7 such that 
Var{-^) C Var{ci,Ci) and ci = c'l^. Suppose also that 

Var{c[)r\Var{Q,Q') = . 

By the Technical Lemma lB.71 p{s[, . . . , s'^^) and p{t[, . . . jt'^) unify. Moreover, as Var{c'i) n 
Var{Q') = 0, p{s'i, . . . , s^) and p{t[, . . . , t'^) are variable disjoint. Notice that the following claim 
holds. 

Claim B.12 Suppose that the atoms A and H are variable disjoint and unify. Then, A also 
unifies with any variant H' of H variable disjoint with A. 

Proof. For some 7 such that Dom{j) C Var{H'), we have H = H'j. Let 6* be a unifier of A and 
H. Then, Ajd ^ AO = HO = H'-f9, so A and H' unify. 

Therefore, as c' is a variant of c'l and c' is variable disjoint with Q', p{t'i, . . . , t'^) and the head 
of c' unify. As they also are variable disjoint, we have 

c 

for some 6' and Q'j^ where c' is the input clause used. Moreover, by the A-Propagation Proposi- 
tion ^^TOj Q[ is A[T+]-more general than Qi. 
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